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FLUORESCENT NUCLEOBASE CONJUGATES 
HAVING ANIONIC LINKERS 

This application claims the benefit of priority of United States provisional application 
Ser. No. 60/239,660 filed October 11, 2000, which is incorporated herein by reference. 

Field of the Invention 

The present invention relates to fluorescent dye compounds, and to conjugates and uses 
thereof The invention also relates to fluorescent polynucleotide conjugates having improved 
electrophoretic mobilities. 

Introduction 

The analysis of complex mixtures of polynucleotides is important in many biological 
applications. In many situations, it is necessary to separate components of such mixtures to 
detect target polynucleotides of interest, to determine relative amounts of different components, 
and to obtain nucleotide sequence information, for example. 

Electrophoresis provides convenient methods for analyzing polynucleotides. Typically, 
polynucleotides can be separated on the basis of length, due to differences in electrophoretic 
mobility. For example, in a matrix such as crosslinked polyacrylamide, polynucleotides 
typically migrate at rates that are inversely proportional to polynucleotide length, due to size- 
dependent obstruction by the crosslinked matrix. In free solution, polynucleotides tend to 
migrate at substantially the same rates because of their substantially identical mass to charge 
ratios, so that it is difficult to distinguish different polynucleotides based on size alone. 
However, distinguishable electrophoretic mobilities can be obtained in free solution using 
polynucleotides that contain different charge/mass ratios, e.g., by attaching to the 
polynucleotides a polymer or other chemical entity having a charge/mass ratio that differs from 
that of the polynucleotides alone (e.g., see US Patent No. 5,470,705). 

When different polynucleotides can be separated based on length or molecular weight, 
detection can usually be accomplished using a single detectable label, such as a radioisotope or 
fluorophore. However, in complex mixtures or when different-sequence polynucleotides have 
similar or identical mobilities, it is preferable to use two or more detectable labels to distinguish 
different polynucleotides unambiguously. 

In DNA sequencing, it is now conventional to use two or more (usually four) different 

fluorescent labels to distinguish sequencing fragments that terminate with one of the four 
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standard nucleotide bases (A, C, G and T, or analogs thereof). Such labels are usually 
introduced into the sequencing fragments using suitably labeled extension primers (dye-primer 
method) or by performing primer extension in the presence of nonextendable nucleotides that 
contain unique labels (Sanger dideoxy terminator method). Electrophoresis of the labeled 
5 products generates ladders of fragments that can be detected on the basis of elution time or band 
position. 

Under sieving conditions in crosslinked or non-crosslinked matrices, shorter poly- 
nucleotide fragments migrate more rapidly than longer fragments. Usually, the inter-band 
spacing and migration rates of fragments decrease gradually in proportion to increasing length. 

10 However, anomalous migration patterns can occur due to sequence-dependent secondary 
structures within fragments, even in the presence of denaturing agents such as urea. For 
example, poly-G segments often cause band compression that make sequence determination of 

- these regions difficult. Compressed band regions can often be resolved using nucleotide analogs 
such as dITP (2'-deoxyinosine-5'-triphosphate) or 7-deaza-dGTP in the extension reaction 

1 5 instead of dGTP, or by sequencing the complementary strand. 

Anomalous migration patterns may also occur for polynucleotide fragments that contain 
a detectable label, due to interactions between the label and one or more bases in the 
polynucleotide. Such interactions can be particularly problematic when the interactions are 
sequence-dependent, so that different-sequence fragments having the same-lengths may have 

20 significantly different mobilities. This phenomenon can be inconvenient for sequencing, 
especially in automated sequencing methods. Accordingly, there is a need for labeled 
compounds and methods of use to improve electrophoretic performance. 

Summary of the Invention 

25 In one aspect, the present invention provides a conjugate comprising a dye-labeled 

nucleobase of the form: (1) B-L-D, wherein B is a nucleobase, L is an anionic linker, and D 
comprises at least one fluorescent dye, or (2) B-L1-D1-L2-D2, wherein B is a nucleobase, LI 
and L2 are linkers such that at least one of LI and L2 is an anionic linker, and Dl and D2 are 
members of a fluorescent donor/acceptor pair, such that one of Dl and D2 is a donor dye 

30 capable of absorbing light at a first wavelength and emitting excitation energy in response 

thereto, and the other of Dl and D2 is an acceptor dye capable of absorbing the excitation 

energy emitted by the donor dye and fluorescing at a second wavelength in response thereto. 

Each anionic linker may contain one or more anionic groups, such as a sulfonic acid 

moiety, a sulfate monoester, an anionic phosphate, an anionic phosphonate, or a carboxylic acid. 
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In one embodiment, L, LI or L2 contains a phosphate diester moiety whose phosphorus atom is 
located within a chain of linker atoms (bridging position) or can be a substituent attached to a 
chain of linker atoms (non-bridging position). In another embodiment, the linker contains a 
monoanionic phosphonate ester which can be located within the linker chain or attached to the 
5 linker chain. Other embodiments are described further herein. 

In embodiments in which the conjugate has the form B-L1-D1-L2-D2, one of LI and L2 
can be a nonanionic linker. In one embodiment, LI is an anionic linker and L2 is non-anionic. 
For example, when L2 is non-anionic, D1-L2-D2 may comprise structure (a), (b) or (c) below: 
(a) -Dl-R 21 ZiC(0)R22R28-D2 
1 0 (b) -Dl -R 28 R 2 2C(0)Z 1 R 2 i-D2 

(c) -DI-R28R22R28-D2 

wherein: R 2 i is C1-C5 alkyldiyl, Zi is NH, S, or O, R 22 is ethenediyl, ethynediyl, 1,3- 
butadienediyl, 1,3-butadiynediyl, a 5- or 6-membered ring having at least one unsaturated bond 

15 or a fused ring structure having at least one unsaturated bone, and R 28 is a bond or spacer group 
(a linking segment) that links R22 to Dl or D2. In another embodiment, LI can be a nonionic 
linker, of which the following are examples: -OCCH 2 NH- -CsCCH 2 NHC(0)(CH 2 ) 5 NH- 
-C=CC(0)NH(CH 2 ) 5 NH- -CsCCH 2 OCH 2 CH 2 NH- -C=CCH 2 OCH 2 CH 2 OCH 2 CH 2 NH- 
-C=C-CH 2 OCH 2 CH 2 -NH- and -C=C(p-C 4 H 6 )OCH 2 CH 2 NH-. 

20 Fluorescent dyes used in accordance with the invention can include any fluorescent 

compound suitable for the purposes of the present invention. Typically, each dye comprises a 
conjugated, resonance-delocalized or aromatic ring system that absorbs light at a first 
wavelength and emits light at a second wavelength in response thereto. For example, the dyes 
can be selected independently from any of a variety of classes of fluorescent compounds, such as 

25 xanthene, rhodamine, dibenzorhodamine, fluorescein, [8,9]benzophenoxazine, cyanine, 
phthalocyanine, squaraine, or bodipy dye. 

In another aspect, the invention includes a labeled nucleoside triphosphate comprising a 
conjugate of the type described herein. In one embodiment, the labeled nucleoside triphosphate 
is not 3 '-extendable. For example, the labeled nucleoside triphosphate can be a 2',3'- 

30 dideoxynucleotide or 3'-fluoro-2',3'-dideoxynucleotide. In another embodiment, the labeled 
nucleoside triphosphate is extendable and contains a 3'-hydroxyl group. 

In another aspect, the invention includes a polynucleotide comprising a conjugate of the 
type discussed herein. In one embodiment, the conjugate is located in a 3' terminal nucleotide 
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subunit of a polynucleotide, such that the subunit may be extendable or nonextendable. In 
another embodiment, the conjugate is located on a non-terminal nucleotide subunit. 

In a further embodiment, the invention provides a mixture comprising a plurality of 
different-sequence polynucleotides, wherein at least one polynucleotide contains a conjugate as 
5 described herein. In one embodiment, the mixture comprises at least two different-sequence 
polynucleotides which each contain a different conjugate to identify the attached polynucleotide. 
In another embodiment, the mixture comprises four classes of polynucleotides, wherein the 
polynucleotides in each class terminate with a different terminator subunit type that contains a 
distinct nucleobase-dye conjugate to identify the polynucleotides in that class. 

10 The invention also includes a method of identifying one or more polynucleotide(s). In 

the method, one or more labeled different-sequence polynucleotides are formed such that each 
different-sequence polynucleotide contains a unique conjugate as described herein. The one or 
more labeled different-sequence polynucleotides are separated by electrophoresis on the basis of 
size, and one or more different-sequence polynucleotides are identified on the basis of 

1 5 electrophoretic mobilities and fluorescence properties. 

The invention also provides a method of forming a labeled polynucleotide strand, the 
method comprising reacting together (i) a duplex polynucleotide comprising a 3 '-extendable 
strand hybridized to a complementary template strand having a 5' overhang, (ii) a template- 
dependent polymerase enzyme, and (iii) a labeled nucleoside triphosphate containing a 

20 conjugate as described herein, under conditions effective to form a labeled polynucleotide 
containing the conjugate. In one embodiment, the labeled nucleoside triphosphate is 
nonextendable. In another embodiment, the labeled nucleoside triphosphate is extendable. 

The invention also provides a method of sequencing a target polynucleotide sequence. 
In the method, four classes of polynucleotides are formed which are complementary to a target 

25 polynucleotide sequence, by template-dependent primer extension, wherein the polynucleotides 
in each class terminate with a different terminator subunit type that contains a distinct 
nucleobase-dye conjugate to identify the polynucleotides in that class. The resultant 
polynucleotides are separated on the basis of size to obtain a mobility pattern from which the 
sequence of the target polynucleotide sequence can be determined. 

30 The invention also provides kits for performing the various methods of the invention. 

For nucleic acid sequencing, the kit comprises at least one labeled nucleoside triphosphate 
comprising a conjugate described herein. The kit may also include one or more of the following 
components: a 3'-extendable primer, a polymerase enzyme, one or more 3' extendable 
nucleotides which are not labeled with conjugate, and/or a buffering agent. In some 
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embodiments, the kit includes at least one labeled nucleoside triphosphate that is nonextendable. 
In other embodiments, the kit comprises four different labeled nucleoside triphosphates which 
are complementary to A, C, T and G, and each of which contains a distinct conjugate as 
described herein. In yet another embodiment, the labeled nucleoside triphosphates are 
5 nonextendable. hi another embodiment, the labeled nucleoside triphosphates are extendable 
ribonucleoside triphosphates. In another embodiment, the kit comprises at least one labeled, 
nonextendable nucleoside triphosphate comprising a conjugate described herein, and one or 
more of the following components: a 3 '-extendable primer, a polymerase enzyme, and/or a 
buffering agent. 

10 These and other objects and features of the invention will become more apparent from 

the detailed description. 

Brief Description of the Drawings 

Figs. 1A-18B and 22A-22B illustrate exemplary synthetic protocols for preparing 
15 various conjugates in accordance with the invention. 

Figs. 19A and 19B show electropherograms of sequencing ladders terminating with a 
first set of terminators (ddG). 

Figs. 20A and 20B show electropherograms of sequencing ladders terminating with a 
second set of terminators different from those in Figs. 1A and IB (ddA). 
20 Figs. 21 A and 21B show electropherograms of sequencing ladders terminating with a 

third set of terminators (ddA). 

Detailed Description 

The present invention is directed to novel dye compounds and dye conjugates that have 

25 various advantageous properties. The invention has general application in the areas of fluorescent 

nucleic acid analysis, e.g., automated DNA or RNA sequencing, fragment analysis, detection of 

nucleic acid amplification products, detection of probe hybridization in hybridization arrays, 

diagnostic tests, and the like. In one aspect, the invention provides polynucleotides having more 

consistent size-dependent electrophoretic mobilities, such that sequence-dependent anomalies 

30 are reduced or eliminated. The invention finds application in automated sequencing methods 

which rely on uniform, size-dependent electrophoretic mobilities to determine whether low peak 

signals should be included or discarded, and whether overlapping peaks represent fragments of 

the same length. The invention is also useful in sequencing methods that involve the formation 

of 3' dye-labeled sequencing fragments. In addition, the invention can be used in polynucleotide 
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detection and identification methods that rely on absolute or relative migration times or 
migration distances for polynucleotide identification. 

I. Definitions 

5 Unless stated otherwise, the following terms and phrases used herein are intended to 

have the following meanings: 

"Alkyl" refers to a saturated or unsaturated, branched, straight-chain or cyclic 
monovalent hydrocarbon radical derived by the removal of one hydrogen atom from a single 
carbon atom of a parent alkane, alkene or alkyne. Typical alkyl groups include, but are not 
10 limited to, methyl; ethyls such as ethanyl, ethenyl, ethynyl; propyls such as propan-l-yl, propan- 
2-yl, cyclopropan-l-yl, prop-l-en-l-yl, prop-l-en-2-yl, prop-2-en-l-yl, cycloprop-l-en-l-yl; 
cycloprop-2-en-l-yl, prop-l-yn-l-yl , prop-2-yn-l-yl, etc.; butyls such as butan-l-yl, butan-2-yl, 
2-methyl-propan-l-yl, 2-methyl-propan-2-yl, cyclobutan-l-yl, but-l-en-l-yl, but-l-en-2-yl, 2- 
methyl-prop-l-en-l-yl, but-2-en-l-yl , but-2-en-2-yl, buta-l,3-dien-l-yl, buta-l,3-dien-2-yl, 
15 cyclobut-l-en-l-yl, cyclobut-l-en-3-yl, cyclobuta-l,3-dien-l-yl, but-l-yn-l-yl, but-l-yn-3-yl, 
but-3-yn-l-yl, etc.; and the like. Where specific levels of saturation are intended, the 
nomenclature "alkanyl," "alkenyl" and/or "alkynyl" is used, as defined below. In preferred 
embodiments, the alkyl groups are (Ci-Ce) alkyl. 

"Alkanyl" refers to a saturated branched, straight-chain or cyclic alkyl radical derived by 
20 the removal of one hydrogen atom from a single carbon atom of a parent alkane. Typical 
~ alkanyl groups include, but are not limited to, methanyl; ethanyl; propanyls such as propan-l-yl, 
propan-2-yl (isopropyl), cyclopropan-l-yl, etc.; butyanyls such as butan-l-yl, butan-2-yl (sec- 
butyl), 2-methyl-propan-l-yl (isobutyl), 2-methyl-propan-2-yl (*-butyl), cyclobutan-l-yl, etc.; 
and the like. In preferred embodiments, the alkanyl groups are (Ci-C 6 ) alkanyl. 
25 "Alkenyl" refers to an unsaturated branched, straight-chain or cyclic alkyl radical having 

at least one carbon-carbon double bond derived by the removal of one hydrogen atom from a 
single carbon atom of a parent alkene. The radical may be in either the cis or trans 
conformation about the double bond(s). Typical alkenyl groups include, but are not limited to, 
ethenyl; propenyls such as prop-l-en-l-yl , prop-l-en-2-yl, prop-2-en-l-yl, prop-2-en-2-yl, 
30 cycloprop-l-en-l-yl; cycloprop-2-en-l-yl ; butenyls such as but-l-en-l-yl, but-l-en-2-yl, 2- 
methyl-prop-l-en-l-yl, but-2-en-l-yl , but-2-en-l-yl, but-2-en-2-yl, buta-l,3-dien-l-yl, buta-1,3- 
dien-2-yl, cyclobut-l-en-l-yl, cyclobut-l-en-3-yl, cyclobuta-l,3-dien-l-yl, etc.; and the like. In 
preferred embodiments, the alkenyl group is (C2-C6) alkenyl. 
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"Alkynyl" refers to an unsaturated branched, straight-chain or cyclic alkyl radical having 
at least one carbon-carbon triple bond derived by the removal of one hydrogen atom from a 
single carbon atom of a parent alkyne. Typical alkynyl groups include, but are not limited to, 
ethynyl; propynyls such as prop-l-yn-l-yl , prop-2-yn-l-yl, etc.; butynyls such as but-l-yn-l-yl, 
5 but-l-yn-3-yl, but-3-yn-l-yl , etc.; and the like. In preferred embodiments, the alkynyl group is 
(C 2 -C 6 ) alkynyl. 

"Alkyldiyl" refers to a saturated or unsaturated, branched, straight-chain or cyclic 
divalent hydrocarbon radical derived by the removal of one hydrogen atom from each of two 
different carbon atoms of a parent alkane, alkene or alkyne, or by the removal of two hydrogen 

10 atoms from a single carbon atom of a parent alkane, alkene or alkyne. The two monovalent 
radical centers or each valency of the divalent radical center can form bonds with the same or 
different atoms. Typical alkyldiyls include, but are not limited to methandiyl; ethyldiyls such as 
ethan-l,l-diyl, ethan-l,2-diyl, ethen-l,l-diyl, ethen-l,2-diyl; propyldiyls such as propan-1,1- 
diyl, propan-l,2-diyl, propan-2,2-diyl, propan-l,3-diyl, cyclopropan-l,l-diyl, cyclopropan-1,2- 

15 diyl, prop-l-en-l,l-diyl, prop-l-en-l,2-diyl, prop-2-en-l,2-diyl, prop-l-en-l,3-diyl, cycloprop- 

1- en-l,2-diyl, cycloprop-2-en- 1,2 -diyl, cycloprop-2-en-l,l-diyl, prop-l-yn-l,3-diyl, etc.; 
butyldiyls such as, butan- 1,1 -diyl, butan-l,2-diyl, butan-l,3-diyl, butan-l,4-diyl, butan-2,2-diyl, 

2- methyl-propan- 1,1 -diyl, 2-methyl-propan-l,2-diyl, cyclobutan- 1,1 -diyl; cyclobutan-l,2-diyl, 
- cyclobutan- 1,3 -diyl, but- 1 -en- 1,1 -diyl, but-l-en-l,2-diyl, but-l-en-l,3-diyl, but-l-en-l,4-diyl, 2- 
20 methyl-prop- 1 -en- 1,1 -diyl, 2-methanylidene-propan-l,l-diyl, buta-l,3-dien-l,l-diyl, buta-1,3- 

dien-l,2-diyl, buta-l,3-dien-l,3-diyl, buta-l,3-dien-l,4-diyl, cyclobut-l-en-l,2-diyl, cyclobut-1- 
en-l,3-diyl, cyclobut-2-en-l,2-diyl, cyclobuta-l,3-dien-l,2-diyl, cyclobuta-l,3-dien-l,3-diyl, 
but- 1-yn- 1,3 -diyl, but-l-yn-l,4-diyl, buta-l,3-diyn-l,4-diyl, etc.; and the like. Where specific 
levels of saturation are intended, the nomenclature alkanyldiyl, alkenyldiyl and/or alkynyldiyl is 

25 used. In preferred embodiments, the alkyldiyl group is (Ci-C 6 ) alkyldiyl. Also preferred are 
saturated acyclic alkanyldiyl radicals in which the radical centers are at the terminal carbons, 
e.g., methandiyl (methano); ethan- 1,2 -diyl (ethano); propan-l,3-diyl (propano); butan- 1,4-diyl 
(butano); and the like (also referred to as alkylenos, defined infra). 

"Alkyleno" refers to a straight-chain alkyldiyl radical having two terminal monovalent 

30 radical centers derived by the removal of one hydrogen atom from each of the two terminal 
carbon atoms of straight-chain parent alkane, alkene or alkyne. Typical alkyleno groups include, 
but are not limited to, methano; ethylenos such as ethano, etheno, ethyno; propylenos such as 
propano, prop[l]eno, propa[l,2]dieno, prop[l]yno, etc.; butylenos such as butano, but[l]eno, 
but[2]eno, buta[l,3]dieno, but[l]yno, but[2]yno, but[l,3]diyno, etc.; and the like. Where 
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specific levels of saturation are intended, the nomenclature alkano, alkeno and/or alkyno is used. 
In preferred embodiments, the alkyleno group is (Ci-Ce) alkyleno. 

"Heteroalkyl, Heteroalkanyl, Heteroalkenyl, Heteroalkanyl, Heteroalkyldiyl and 
Heteroalkyleno" refer to alkyl, alkanyl, alkenyl, alkynyl, alkyldiyl and alkyleno radicals, 
5 respectively, in which one or more of the carbon atoms are each independently replaced with the 
same or different heteroatomic groups. Typical heteroatomic groups which can be included in 
these radicals include, but are not limited to, -O-, -S-, -O-O-, -S-S-, -0-S-, -NR-, =N-N= -N=N- 
, -N(0)N-, -N=N-NR'-, -PH-, -P(0) 2 -, -0-P(0) 2 -, -SH 2 -, -S(0) 2 -, -SnH 2 - an the like, where each 
R' is independently hydrogen, alkyl, alkanyl, alkenyl, alkynyl, aryl, arylaryl, arylalkyl, 

10 heteroaryl, heteroarylalkyl or heteroaryl-heteroaryl as defined herein. 

"Acyclic Heteroatomic Bridge" refers to a divalent bridge in which the backbone atoms 
are exclusively heteroatoms. Typical acyclic heteroatomic bridges include, but are not limited 
to, any of the various heteroatomic groups listed above, either alone or in combinations. 

"Parent Aromatic Ring System" refers to an unsaturated cyclic or polycyclic ring system 

15 having a conjugated tz electron system. Specifically included within the definition of "parent 
aromatic ring system" are fused ring systems in which one or more of the rings are aromatic and 
one or more of the rings are saturated or unsaturated, such as, for example, indane, indene, 
phenalene, etc. Typical parent aromatic ring systems include, but are not limited to, 
aceanthrylene, acenaphthylene, acephenanthrylene, anthracene, azulene, benzene, chrysene, 

20 coronene, fluoranthene, fluorene, hexacene, hexaphene, hexalene, as-indacene, s-indacene, 
indane, indene, naphthalene, octacene, octaphene, octalene, ovalene, penta-2,4-diene, pentacene, 
pentalene, pentaphene, perylene, phenalene, phenanthrene, picene, pleiadene, pyrene, 
pyranthrene, rubicene, triphenylene, trinaphthalene, and the like. 

"Aryl" refers to a monovalent aromatic hydrocarbon radical derived by the removal of 

25 one hydrogen atom from a single carbon atom of a parent aromatic ring system. Typical aryl 
groups include, but are not limited to, radicals derived from aceanthrylene, acenaphthylene, 
acephenanthrylene, anthracene, azulene, benzene, chrysene, coronene, fluoranthene, fluorene, 
hexacene, hexaphene, hexalene, as-indacene, s-indacene, indane, indene, naphthalene, octacene, 
octaphene, octalene, ovalene, penta-2,4-diene, pentacene, pentalene, pentaphene, perylene, 

30 phenalene, phenanthrene, picene, pleiadene, pyrene, pyranthrene, rubicene, triphenylene, 
trinaphthalene, and the like. In preferred embodiments, the aryl group is (C5-C14) aryl, with (C5- 
C10) being even more preferred. Particularly preferred aryls are phenyl and naphthyl. 

8 
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"Aryldiyl" refers to a divalent aromatic hydrocarbon radical derived by the removal of 
one hydrogen atom from each of two different carbon atoms of a parent aromatic ring system or 
by the removal of two hydrogen atoms from a single carbon atom of a parent aromatic ring 
system. The two monovalent radical centers or each valency of the divalent center can form 
5 bonds with the same or different atom(s). Typical aryldiyl groups include, but are not limited to, 
divalent radicals derived from aceanthrylene, acenaphthylene, acephenanthrylene, anthracene, 
azulene, benzene, chrysene, coronene, ffuoranthene, fluorene, hexacene, hexaphene, hexalene, 
as-indacene, s-indacene, indane, indene, naphthalene, octacene, octaphene, octalene, ovalene, 
penta-2,4-diene, pentacene, pentalene, pentaphene, perylene, phenalene, phenanthrene, picene, 

10 pleiadene, pyrene, pyranthrene, rubicene, triphenylene, trinaphthalene, and the like. In preferred 
embodiments, the aryldiyl group is (C5-C14) aryldiyl, with (C5-C10) being even more preferred. 
The most preferred aryldiyl groups are divalent radicals derived from benzene and naphthalene, 
especially phena-l,4-diyl, naphtha-2,6-diyl and naphtha-2,7-diyl. 

"Aryleno" refers to a divalent bridge radical having two adjacent monovalent radical 

1 5 centers derived by the removal of one hydrogen atom from each of two adjacent carbon atoms of 
a parent aromatic ring system. Attaching an aryleno bridge radical, e.g. benzeno, to a parent 
aromatic ring system, e.g. benzene, results in a fused aromatic ring system, e.g. naphthalene. 
The bridge is assumed to have the maximum number of non-cumulative double bonds consistent 
with its attachment to the resultant fused ring system. In order to avoid double-counting carbon 

"20 atoms, when an aryleno substituent is formed by taking together two adjacent substituents on a 
structure that includes alternative substituents, the carbon atoms of the aryleno bridge replace the 
bridging carbon atoms of the structure. As an example, consider the following structure 




R 2 



Wherein R 1 , when taken alone is hydrogen, or when taken together with R 2 is (C5-C14) aryleno; 

25 and R 2 , when taken alone is hydrogen, or when taken together with R 1 is (C5-C14) aryleno. 

When R 1 and R 2 are each hydrogen, the resultant compound is benzene. When R 1 taken 
together with R 2 is C 6 aryleno (benzeno), the resultant compound is naphthalene. When R 1 
taken together with R 2 is C10 aryleno (naphthaleno), the resultant compound is anthracene or 
phenanthrene. Typical aryleno groups include, but are not limited to, aceanthryleno, 

30 acenaphthyleno, acephenanthryleno, anthraceno, azuleno, benzeno (benzo), chryseno, coroneno, 
fluorantheno, fluoreno, hexaceno, hexapheno, hexaleno, <xs-indaceno, ^-indaceno, indeno, 
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naphthaleno (naphtho), octaceno, octapheno, octaleno, ovaleno, penta-2,4-dieno, pentaceno, 
pentaleno, pentapheno, peryleno, phenaleno, phenanthreno, piceno, pleiadeno, pyreno, 
pyranthreno, rubiceno, triphenyleno, trinaphthaleno, and the like. Where a specific connectivity 
is intended, the involved bridging carbon atoms (of the aryleno bridge) are denoted in brackets, 
5 e.g., [l,2]benzeno ([l,2]benzo), [l,2]naphthaleno, [2,3]naphthaleno, etc. Thus, in the above 
example, when R 1 taken together with R 2 is [2,3]naphthaleno, the resultant compound is 
anthracene. When R 1 taken together with R 2 is [l,2]naphthaleno, the resultant compound is 
phenanthrene. In a preferred embodiment, the aryleno group is (C5-C14), with (C5-C10) being 
even more preferred. 

10 "Arylaryl" refers to a monovalent hydrocarbon radical derived by the removal of one 

hydrogen atom from a single carbon atom of a ring system in which two or more identical or 
non-identical parent aromatic ring systems are joined directly together by a single bond, where 
the number of such direct ring junctions is one less than the number of parent aromatic ring 
systems involved. Typical arylaryl groups include, but are not limited to, biphenyl, triphenyl, 

15 phenyl-naphthyl, binaphthyl, biphenyl-naphthyl, and the like. When the number of carbon 
atoms comprising an arylaryl group is specified, the numbers refer to the carbon atoms 
comprising each parent aromatic ring. For example, (C5-C14) arylaryl is an arylaryl group in 
which each aromatic ring comprises from 5 to 14 carbons, e.g., biphenyl, triphenyl, binaphthyl, 
phenylnaphthyl, etc. Preferably, each parent aromatic ring system of an arylaryl group is 

20 independently a (C5-C14) aromatic, more preferably a (C5-C10) aromatic. Also preferred are 
arylaryl groups in which all of the parent aromatic ring systems are identical, e.g. , biphenyl, 
triphenyl, binaphthyl, trinaphthyl, etc. 

"Biaryl" refers to an arylaryl radical having two identical parent aromatic systems joined 
directly together by a single bond. Typical biaryl groups include, but are not limited to, 

25 biphenyl, binaphthyl, bianthracyl, and the like. Preferably, the aromatic ring systems are (C5- 
C14) aromatic rings, more preferably (C5-C10) aromatic rings. A particularly preferred biaryl 
group is biphenyl. 

"Arylalkyl" refers to an acyclic alkyl radical in which one of the hydrogen atoms bonded 

to a carbon atom, typically a terminal or sp 2 carbon atom, is replaced with an aryl radical. 

30 Typical arylalkyl groups include, but are not limited to, benzyl, 2-phenylethan-l-yl, 2- 

phenylethen-l-yl, naphthylmethyl, 2-naphthylethan-l-yl, 2-naphthylethen-l-yl, naphthobenzyl, 

2-naphthophenylethan-l-yl and the like. Where specific alkyl moieties are intended, the 

nomenclature arylalkanyl, arylakenyl and/or arylalkynyl is used, hi preferred embodiments, the 

arylalkyl group is (C6-C20) arylalkyl, e.g., the alkanyl, alkenyl or alkynyl moiety of the arylalkyl 
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group is (Ci-C 6 ) and the aryl moiety is (C 5 -Cu)- In particularly preferred embodiments the 
arylalkyl group is (C 6 -Ci 3 ), e.g., the alkanyl, alkenyl or alkynyl moiety of the arylalkyl group is 
(C1-C3) and the aryl moiety is (C5-C10). 

"Parent Heteroaromatic Ring System" refers to a parent aromatic ring system in which 
5 one or more carbon atoms (and any necessary associated hydrogen atoms) are each 
independently replaced with the same or different heteroatom. Typical heteratoms to replace 
the carbon atoms include, but are not limited to, N, P, O, S, Si, etc. Specifically included within 
the definition of "parent heteroaromatic ring systems" are fused ring systems in which one or 
more rings are aromatic and one or more of the rings are saturated or unsaturated, such as, for 

10 example, arsindole, chromane, chromene, indole, indoline, xanfhene, etc. Typical parent 
heteroaromatic ring systems include, but are not limited to, arsindole, carbazole, (3-carboline, 
chromane, chromene, cinnoline, furan, imidazole, indazole, indole, indoline, indolizine, 
isobenzofuran, isochromene, isoindole, isoindoline, isoquinoline, isothiazole, isoxazole, 
naphthyridine, oxadiazole, oxazole, perimidine, phenanthridine, phenanthroline, phenazine, 

15 phthalazine, pteridine, purine, pyran, pyrazine, pyrazole, pyridazine, pyridine, pyrimidine, 
pyrrole, pyrrolizine, quinazoline, quinoline, quinolizine, quinoxaline, tetrazole, thiadiazole, 
thiazole, thiophene, triazole, xanthene, and the like. 

"Heteroaryl" refers to a monovalent heteroaromatic radical derived by the removal of one 
hydrogen atom from a single atom of a parent heteroaromatic ring system. Typical heteroaryl 

20 groups include, but are not limited to, radicals derived from acridine, arsindole, carbazole, 0- 
carboline, chromane, chromene, cinnoline, furan, imidazole, indazole, indole, indoline, 
indolizine, isobenzofuran, isochromene, isoindole, isoindoline, isoquinoline, isothiazole, 
isoxazole, naphthyridine, oxadiazole, oxazole, perimidine, phenanthridine, phenanthroline, 
phenazine, phthalazine, pteridine, purine, pyran, pyrazine, pyrazole, pyridazine, pyridine, 

25 pyrimidine, pyrrole, pyrrolizine, quinazoline, quinoline, quinolizine, quinoxaline, tetrazole, 
thiadiazole, thiazole, thiophene, triazole, xanthene, and the like. In preferred embodiments, the 
heteroaryl group is a 5-14 membered heteroaryl, with 5-10 membered heteroaryl being 
particularly preferred. The most preferred heteroaryl radicals are those derived from parent 
heteroaromatic ring systems in which any ring heteroatoms are nitrogens, such as imidazole, 

30 indole, indazole, isoindole, naphthyridine, pteridine, isoquinoline, phthalazine, purine, pyrazole, 
pyrazine, pyridazine, pyridine, pyrrole, quinazoline, quinoline, etc. 

"Heteroaryldiyl" refers to a divalent heteroaromatic radical derived by the removal of 
one hydrogen atom from each of two different atoms of a parent heteroaromatic ring system or 
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by the removal of two hydrogen atoms from a single atom of a parent heteroaromatic ring 
system. The two monovalent radical centers or each valency of the single divalent center can 
form bonds with the same or different atom(s). Typical heteroaryldiyl groups include, but are 
not limited to, divalent radicals derived from acridine, arsindole, carbazole, p-carboline, 
5 chromane, chromene, cinnoline, furan, imidazole, indazole, indole, indoline, indolizine, 
isobenzofuran, isochromene, isoindole, isoindoline, isoquinoline, isothiazole, isoxazole, 
naphthyridine, oxadiazole, oxazole, perimidine, phenanthridine, phenanthroline, phenazine, 
phthalazine, pteridine, purine, pyran, pyrazine, pyrazole, pyridazine, pyridine, pyrimidine, 
pyrrole, pyrrolizine, quinazoline, quinoline, quinolizine, quinoxaline, tetrazole, thiadiazole, 

10 thiazole, thiophene, triazole, xanthene, and the like. In preferred embodiments, the 
heteroaryldiyl group is 5-14 membered heteroaryldiyl, with 5-10 membered being particularly 
preferred. The most preferred heteroaryldiyl groups are divalent radicals derived from parent 
heteroaromatic ring systems in which any ring heteroatoms are nitrogens, such as imidazole, 
indole, indazole, isoindole, naphthyridine, pteridine, isoquinoline, phthalazine, purine, pyrazole, 

1 5 pyrazine, pyridazine, pyridine, pyrrole, quinazoline, quinoline, etc. 

"Heteroaryleno" refers to a divalent bridge radical having two adjacent monovalent 
radical centers derived by the removal of one hydrogen atom from each of two adjacent atoms of 
a parent heteroaromatic ring system. Attaching a heteroaryleno bridge radical, e.g. pyridino, to a 
parent aromatic ring system, e.g. benzene, results in a fused heteroaromatic ring system, e.g., 

20 quinoline. The bridge is assumed to have the maximum number of non-cumulative double 
bonds consistent with its attachment to the resultant fused ring system. In order to avoid double- 
counting ring atoms, when a heteroaryleno substituent is formed by taking together two adjacent 
substituents on a structure that includes alternative substituents, the ring atoms of the 
heteroaryleno bridge replace the bridging ring atoms of the structure. As an example, consider 

25 the following structure 




R 2 



wherein R 1 , when taken alone is hydrogen, or when taken together with R 2 is 5-14 membered 
heteroaryleno; and R 2 , when taken alone is hydrogen, or when taken together with R 1 is 5-14 
membered heteroaryleno. 

30 When R 1 and R 2 are each hydrogen, the resultant compound is benzene. When R 1 taken 

together with R 2 is a 6-membered heteroaryleno (e.g., pyridino), the resultant compound is 
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isoquinoline, quinoline or quinolizine. When R 1 taken together with R 2 is a 10-membered 
heteroaryleno (e.g., isoquinoline), the resultant compound is, e.g., acridine or phenanthridine. 
Typical heteroaryleno groups include, but are not limited to, acridino, carbazolo, p-carbolino, 
chromeno, cinnolino, furano, imidazolo, indazoleno, indoleno, indolizino, isobenzofurano, 
5 isochromeno, isoindoleno, isoquinolino, isothiazoleno, isoxazoleno, naphthyridino, 
oxadiazoleno, oxazoleno, perimidino, phenanthridino, phenanthrolino, phenazino, phthalazino, 
pteridino, purino, pyrano, pyrazino, pyrazoleno, pyridazino, pyridino, pyrimidino, pyrroleno, 
pyrrolizino, quinazolino, quinolino, quinolizino, quinoxalino, tetrazoleno, thiadiazoleno, 
thiazoleno, thiopheno, triazoleno, xantheno, and the like. Where a specific connectivity is 

10 intended, the involved bridging atoms (of the heteroaryleno bridge) are denoted in brackets, e.g., 
[l,2]pyridino, [2,3]pyridino, [3,4]pyridino, etc. Thus, in the above example, when R 1 taken 
together with R 2 is [l,2]pyridino, the resultant compound is quinolizine. When R 1 taken 
together with R 2 is [2,3]pyridino, the resultant compound is quinoline. When R 1 taken together 
with R 2 is [3,4]pyridino, the resultant compound is isoquinoline. In preferred embodiments, the 

15 heteroaryleno group is 5-14 membered heteroaryleno, with 5-10 membered being even more 
preferred. The most preferred heteroaryleno radicals are those derived from parent 
heteroaromatic ring systems in which any ring heteroatoms are nitrogens, such as imidazolo, 
indolo, indazolo, isoindolo, naphthyridino, pteridino, isoquinolino, phthalazino, purino, 
pyrazolo, pyrazino, pyridazino, pyridino, pyrrolo, quinazolino, quinolino, etc. 

20 "Heteroaryl-Heteroaryl" refers to a monovalent heteroaromatic radical derived by the 

removal of one hydrogen atom from a single atom of a ring system in which two or more 
identical or non-identical parent heteroaromatic ring systems are joined directly together by a 
single bond, where the number of such direct ring junctions is one less than the number of parent 
heteroaromatic ring systems involved. Typical heteroaryl-heteroaryl groups include, but are not 

25 limited to, bipyridyl, tripyridyl, pyridylpurinyl, bipurinyl, etc. When the number of ring atoms 
are specified, the numbers refer to the number of atoms comprising each parent heteroatomatic 
ring systems. For example, 5-14 membered heteroaryl-heteroaryl is a heteroaryl-heteroaryl 
group in which each parent heteroaromatic ring system comprises from 5 to 14 atoms, e.g., 
bipyridyl, tripyridyl, etc. Preferably, each parent heteroaromatic ring system is independently a 

30 5-14 membered heteroaromatic, more preferably a 5-10 membered heteroaromatic. Also 
preferred are heteroaryl-heteroaryl groups in which all of the parent heteroaromatic ring systems 
are identical. The most preferred heteroaryl-heteroaryl radicals are those in which each 
heteroaryl group is derived from parent heteroaromatic ring systems in which any ring 
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heteroatoms are nitrogens, such as imidazole, indole, indazole, isoindole, naphthyridine, 
pteridine, isoquinoline, phthalazine, purine, pyrazole, pyrazine, pyridazine, pyridine, pyrrole, 
quinazoline, quinoline, etc. 

"Biheteroaryl" refers to a heteroaryl-heteroaryl radical having two identical parent 
5 heteroaromatic ring systems joined directly together by a single bond. Typical biheteroaryl 
groups include, but are not limited to, bipyridyl, bipurinyl, biquinolinyl, and the like. 
Preferably, the heteroaromatic ring systems are 5-14 membered heteroaromatic rings, more 
preferably 5-10 membered heteroaromatic rings. The most preferred biheteroaryl radicals are 
those in which the heteroaryl groups are derived from a parent heteroaromatic ring system in 

10 which any ring heteroatoms are nitrogens, such as biimidazolyl, biindolyl, biindazolyl, 
biisoindolyl, binaphthyridinyl, bipteridinyl, biisoquinolinyl, biphthalazinyl, bipurinyl, 
bipyrazolyl, bipyrazinyl, bipyridazinyl, bipyridinyl, bipyrrolyl, biquinazolinyl, biquinolinyl, etc. 

: "Heteroarylalkyl" refers to an acyclic alkyl radical in which one of the hydrogen atoms 

bonded to a carbon atom, typically a terminal or sp 3 carbon atom, is replaced with a heteroaryl 

15 radical. Where specific alkyl moieties are intended, the nomenclature heteroarylalkanyl, 
heteroarylakenyl and/or heterorylalkynyl is used. In preferred embodiments, the heteroarylalkyl 
group is a 6-20 membered heteroarylalkyl, e.g., the alkanyl, alkenyl or alkynyl moiety of the 
heteroarylalkyl is 1-6 membered and the heteroaryl moiety is a 5-14-membered heteroaryl. In 
particularly preferred embodiments, the heteroarylalkyl is a 6-13 membered heteroarylalkyl, 

20 e.g., the alkanyl, alkenyl or alkynyl moiety is 1-3 membered and the heteroaryl moiety is a 5-10 
membered heteroaryl. 

"Substituted" refers to a radical in which one or more hydrogen atoms are each 
independently replaced with the same or different substituent(s). Typical substituents include, 
but are not limited to, -X, -R, -O", =0, -OR,-SR, -S", =S, -NRR, =NR, perhalo (Ci-C 6 ) alkyl - 

25 CX 3 , -CF 3 , -CN, -OCN, -SCN, -NCO, -NCS, -NO, -N0 2 , =N 2 , -N 3 , -S(0) 2 0\ -S(0) 2 OH, - 
S(0) 2 R, -C(0)R, -C(0)X, -C(S)R, -C(S)X, -C(0)OR, -C(0)0\ -C(S)OR, -C(0)SR, - 
C(S)SR, -C(0)NRR, -C(S)NRR and -C(NR)NRR, where each X is independently a halogen 
(preferably -F or -CI) and each R is independently hydrogen, alkyl, alkanyl, alkenyl, alkynyl, 
aryl, arylalkyl, arylaryl, heteroaryl, heteroarylalkyl or heteroaryl-heteroaryl, carboxyl, acetyl, 

30 sulfonyl, sulfinyl, sulfone, phosphate, or phosphonate, as defined herein. The actual substituent 

substituting any particular group will depend upon the identity of the group being substituted. 

"Nucleobase" means a nitrogen-containing heterocyclic moiety capable of forming 

Watson-Crick type hydrogen bonds with a complementary nucleobase or nucleobase analog, e.g. 

a purine, a 7-deazapurine, or a pyrimidine. Typical nucleobases are the naturally occurring 
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nucleobases adenine, guanine, cytosine, uracil, thymine, and analogs of naturally occurring 
nucleobases, e.g. 7-deazaadenine, 7-deaza-8-azaadenine, 7-deazaguanine, 7-deaza-8-azaguanine, 
inosine, nebularine, nitropyrrole, nitroindole, 2-amino-purine, 2,6-diamino-purine, 
hypoxanthine, pseudouridine, pseudocytidine, pseudoisocytidine, 5-propynyl-cytidine, 
5 isocytidine, isoguanine, 2-thiopyrimidine, 6-thioguanine, 4-thiothymine, 4-thiouracil, O 6 - 
methylguanine, /V^methyl-adenine, O^-methylthymine, 5,6-dihydrothymine, 5,6-dihydrouracil, 
4-methylindole, and ethenoadenine (Fasman, Practical Handbook of Biochemistry and 
Molecular Biology, pp. 385-394, CRC Press, Boca Raton, Fl (1989)). 

"Nucleoside" means a compound comprising a nucleobase linked to a C-l' carbon of a 

10 ribose sugar or analog thereof. The ribose or analog may be substituted or unsubstituted. 
Substituted ribose sugars include, but are not limited to, those riboses in which one or more of 
the carbon atoms, preferably the 3'-carbon atom, is substituted with one or more of the same or 
different substituents such as -R, -OR, -NRR or halogen (e.g., fluoro, chloro, bromo, or iodo), 
where each R group is independently -H, Ci-C 6 alkyl or C3-C14 aryl. Particularly preferred 

15 riboses are ribose, 2'-deoxyribose, 2',3'-dideoxyribose, 3'-haloribose (such as 3'-fluororibose or 
3'-chlororibose) and 3-alkylribose. Typically, when the nucleobase is A or G, the ribose sugar 
is attached to the N 9 -position of the nucleobase. When the nucleobase is C, T or U, the pentose 
sugar is attached to the N 1 -position of the nucleobase (Romberg and Baker, DNA Replication, 
2 nd Ed., Freeman, San Francisco, CA, (1992)). Examples of ribose analogs include arabinose, 

20 2'-0-methyl ribose, and locked nucleoside analogs (e.g., WO 99/14226), for example, although 
many other analogs are also known in the art. 

"Nucleotide" means a phosphate ester of a nucleoside, either as an independent monomer or 
as a subunit within a polynucleotide. Nucleotide triphosphates are sometimes denoted as "NTP", 
"dNTP" (2'-deoxypentose) or "ddNTP" (2',3'-dideoxypentose) to particularly point out the 

25 structural features of the ribose sugar. "Nucleotide 5'-triphosphate" refers to a nucleotide with a 
triphosphate ester group at the 5' position. The triphosphate ester group may include sulfur 
substitutions for one or more phosphate oxygen atoms, e.g. a-thionucleotide 5'-triphosphates. 

"Polynucleotide" and "oligonucleotide", which are used interchangeably herein, refer to 
linear polymers of natural nucleotide monomers or analogs thereof, including for example, double- 

30 and single-stranded deoxyribonucleotides, ribonucleotides, a-anomeric forms thereof, and the like. 
A polynucleotide may be composed entirely of deoxyribonucleotides, ribonucleotides, or 
analogs thereof, or may contain blocks or mixtures of two or more different monomer types. 
Usually nucleoside monomers are linked by phosphodiester linkages. However, polynucleotides 
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and oligonucleotides containing non-phosphodiester linkages are also contemplated. 
"Polynucleotide" and "oligonucleotide" also encompass polymers that contain one or more non- 
naturally occurring monomers and/or intersubunit linkages, such as peptide nucleic acids (PNAs, 
e.g., polymers comprising a backbone of amide-linked N-(2-aminoethyl)-glycine subunits to which 
5 nucleobases are attached via the non-amide backbone nitrogens. See Nielsen et al., Science 
254:1497-1500 (1991)). Polynucleotides typically range in size from a few monomeric units, e.g. 
8-40, to several thousand monomeric units. Whenever a polynucleotide is represented by a 
sequence of letters, such as "ATGCCTG," it will be understood that the nucleotides are in 5'->3' 
order from left to right and that "A" denotes deoxyadenosine, "C" denotes deoxycytidine, "G" 

10 denotes deoxyguanosine, and "T" denotes thymidine, unless otherwise noted. 

"Nucleotide subunit" or "polynucleotide subunit" refers to a single nucleotide or nucleotide 
analog within a polynucleotide or polynucleotide analog. 

"Phosphate analog" refers to an analog of phosphate wherein one or more of the oxygen 
atoms is replaced with a non-oxygen moiety. Exemplary phosphate analogs including 

15 phosphorothioate, phosphorodithioate, phosphoroselenoate, phosphorodiselenoate, phosphoro- 
anilothioate, phosphotriester, phosphoranilidate, phosphoramidate, alkylphosphonates such as 
methylphosphonates, boronophosphates. 

"Linker" refers to a moiety that links a dye to a substrate such as an oligonucleotide, or 
links one dye to another dye (e.g., links a donor to an acceptor dye). 

20 "Enzymatically incorporatable" means that a nucleotide is capable of being enzymatically 

incorporated onto the terminus, e.g. 3' terminus, of a polynucleotide chain, or internally through 
nick-translation of a polynucleotide chain, through action of a template-dependent or template- 
independent polymerase enzyme. A nucleotide- 5 '-triphosphate is an example of an 
enzymatically incorporatable nucleotide. 

25 "Enzymatically extendable" or "3' extendable" means a nucleotide or polynuceotide that is 

capable of being appended to a nucleotide or polynucleotide by enzyme action. A 
polynucleotide containing a 3' hydroxyl group is an example of an enzymatically extendable 
polynucleotide. 

"Terminator" means an enzymatically incorporatable nucleotide which prevents subsequent 
30 incorporation of nucleotides to the resulting polynucleotide chain and thereby halts polymerase- 
mediated extension. Typical terminators lack a 3 -hydroxyl substituent and include 2',3'- 
dideoxyribose, 2',3'-didehydroribose, and 2',3'-dideoxy-3'-haloribose, e.g. 3'-deoxy-3'-fluoro-ribose 
or 2',3'-dideoxy-3'-fluororibose, for example. Alternatively, a ribofuranose analog can be used, 
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such as 2',3'-dideoxy-P-D-ribofuranosyl, P-D-arabinofuranosyl, 3'-deoxy-C3-D- 
arabinofuranosyl, 3'-amino-2',3'-dideoxy-p-D-ribofuranosyl, and 2%3'-dideoxy-3'-fiuoro-p-D- 
ribofuranosyl (see, for example, Chidgeavadze et al., Nucleic Acids Res., 12: 1671-1686 (1984), 
and Chidgeavadze et al. FEB. Lett., 183: 275-278 (1985)). Nucleotide terminators also include 
5 reversible nucleotide terminators (Metzker et al. Nucleic Acids Res., 22(20):4259 (1994)). 

"Nonextendable" or "3' nonextendable" refers to the fact that a terminator is incapable, or 
substantially incapable, of being extended in the 3' direction by a template-dependent DNA or 
RNA polymerase. 

"Spectrally resolvable" means that two or more dyes have emission bands that are 
10 sufficiently distinct, i.e., sufficiently non-overlapping, that they can be disinguished on the basis 
of a unique fluorescent signal generated by each dye. 

Generally, whenever a compound mentioned in this disclosure contains a positive or 
negative charge, it should be understood that such compound may also be accompanied by a 
suitable counterion that balances the positive or negative charge. Exemplary positively charged 
15 counterions include, without limitation, H*, NH/, Na + , K + , Mg 2+ , trialkylammonium (such as 
triethylammonium), tetraalkylammonium (such as tetraethylammonium), and the like. Exemplary 
negatively charged counterions include, without limitation, carbonate, bicarbonate, acetate, 
chloride, and phosphate, for example. Also, although particular resonance structures maybe shown 
herein, such structures are intended to include all other possible resonance structures. 

20 

H. Conjugates 

In one aspect, the present invention provides compositions that comprise at least one 
dye-labeled nucleobase of the type described herein. Such compositions include not only 
nucleobase-dye conjugates as independent molecules, but also as nucleosides, nucleotides and 
25 polynucleotides containing such conjugates. 

In one embodiment, a dye-labeled nucleobase of the invention has the form B-L-D, 
wherein B is a nucleobase, L is an anionic linker, and D is a fluorescent dye. 

Nucleobase B may be any moiety capable of forming Watson-Crick hydrogen bonds with 
a complementary nucleobase or nucleobase analog, as set forth in the Definition section above. 
30 Typically, B is a nitrogen-containing heterocyclic moiety such as a 7-deazapurine, purine, or 
pyrimidine nucleotide base. In certain embodiments, B is uracil, cytosine, 7-deazaadenine, or 7- 
deazaguanosine. When B is a purine, the linker is usually attached to the 8-position of the purine. 
When B is a 7-deazapurine, the linker to the dye is usually attached to the 7-position of the 7- 
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deazapurine. When B is pyrimidine, the linker is usually attached to the 5-position of the 
pyrrolidine. 

Fluorescent dye D may be any fluorescent dye which is suitable for the purposes of the 
invention. Typically, the fluorescent dye comprises a resonance-delocalized system or aromatic 
5 ring system that absorbs light at a first wavelength and emits fluorescent light at a second 
wavelength in response to the absorption event. A wide variety of such dye molecules are 
known in the art. For example, fluorescent dyes can be selected from any of a variety of classes 
of fluorescent compounds, such as xanthenes, rhodamines, fluoresceins, cyanines, 
phthalocyanines, squaraines, and bodipy dyes. 
10 In one embodiment, the dye is a xanthene- type dye, which contains a fused three-ring 

system of the form: 



This parent xanthene ring may be unsubstituted (i.e., all substituents are H) or may be 
substituted with one or more of a variety of the same or different substituents, such as described 



In the parent xanthene ring depicted above, A 1 is OH or NH2 and A 2 is O or NH2 + . When A 1 is 
OH and A 2 is O, the parent xanthene ring is a fluorescein-type xanthene ring. When A 1 is NH 2 
and A 2 is NH2 + , the parent xanthene ring is a rhodamine-type xanthene ring. When A 1 is NH 2 
20 and A 2 is O, the parent xanthene ring is a rhodol-type xanthene ring. In the parent xanthene ring 
depicted above, one or both nitrogens of A 1 and A 2 (when present) and/or one or more of the 
carbon atoms at positions CI, C2, C4, C5, C7, C8 and C9 can be independently substituted with 
a wide variety of the same or different substituents. In one embodiment, typical substituents 
include, but are not limited to, -X, -R, -OR, -SR, -NRR, perhalo (Ci-C 6 ) alkyl ,-CX 3 , -CF 3 , - 



25 CN, -OCN, -SCN, -NCO, -NCS, -NO, -N0 2 , -N 3 , -S(0) 2 0", -S(0) 2 OH, -S(0) 2 R, -C(0)R, - 
C(0)X, -C(S)R, -C(S)X, -C(0)OR, -C(0)0", -C(S)OR, -C(0)SR, -C(S)SR, -C(0)NRR, - 
C(S)NRR and -C(NR)NRR, where each X is independently a halogen (preferably -F or CI) and 
each R is independently hydrogen, (C1-C6) alkyl, (Ci-Ce) alkanyl, (C]-C 6 ) alkenyl, (Ci-C<$) 




15 



below. 
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alkynyl, (C 5 -C 20 ) aryl, (C 6 -C 2 6) arylalkyl, (C 5 -C 20 ) arylaryl, heteroaryl, 6-26 membered 
heteroarylalkyl 5-20 membered heteroaryl-heteroaryl, carboxyl, acetyl, sulfonyl, sulfmyl, 
sulfone, phosphate, or phosphonate. Moreover, the CI and C2 substituents and/or the C7 and 
C8 substituents can be taken together to form substituted or unsubstituted buta[l,3]dieno or (C5- 
C20) aryleno bridges. Generally, substituents which do not tend to quench the fluorescence of 
the parent xanthene ring are preferred, but in some embodiments quenching substituents may be 
desirable. Substituents that tend to quench fluorescence of parent xanthene rings are electron- 
withdrawing groups, such as -N0 2 , -Br, and -I. In one embodiment, C9 is unsubstituted. In 
another embodiment, C9 is substituted with a phenyl group. In another embodiment, C9 is 
substituted with a substituent other than phenyl. 

When A ! is NH 2 and/or A 2 is NH 2 + , these nitrogens can be included in one or more 
bridges involving the same nitrogen atom or adjacent carbon atoms, e.g., (Ci-Q 2 ) alkyldiyl, (d- 
Ci 2 ) alkyleno, 2-12 membered heteroalkyldiyl and/or 2-12 membered heteroalkyleno bridges. 

Any of the substituents on carbons CI, C2, C4, C5, C7, C8, C9 and/or nitrogen atoms at 
C3 and/or C6 (when present) can be further substituted with one or more of the same or different 
substituents, which are typically selected from -X, -R', =0, -OR', -SR\ =S, -NR'R', =NR\ -CX 3 , 
-CN, -OCN, -SCN, -NCO, -NCS, -NO, -N0 2 , =N 2 , -N 3 , -NHOH, -S(0) 2 0~, -S(0) 2 OH, -S(0) 2 R', 
-P(0)(0") 2 , -P(0)(OH) 2 , -C(0)R', -C(0)X, -C(S)R', -C(S)X, -C(0)0R\ -C(0)0\ -C(S)OR\ - 
C(0)SR', -C(S)SR', -C(0)NR'R', -C(S)NR'R' and -C(NR)NR'R', where each X is independently 
a halogen (preferably -F or -CI) and each R' is independently hydrogen, (Ci-Ce) alkyl, 2-6 
membered heteroalkyl, (C5-C14) aryl or heteroaryl, carboxyl, acetyl, sulfonyl, sulfinyl, sulfone, 
phosphate, or phosphonate. 

Exemplary parent xanthene rings include, but are not limited to, rhodamine-type parent 
xanthene rings and fluorescein-type parent xanthene rings. 

In one embodiment, the dye contains a rhodamine-type xanthene dye that includes the 
following ring system: 




9 1 



In the rhodamine-type xanthene ring depicted above, one or both nitrogens and/or one or more 
of the carbons at positions CI, C2, C4, C5, C7 or C8 can be independently substituted with a 
wide variety of the same or different substituents, as described above for the parent xanthene 
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rings, for example. Exemplary rhodamine-type xanthene dyes include, but are not limited to, the 
xanthene rings of the rhodamine dyes described in US Patents 5,936,087, 5,750,409, 5,366,860, 
5,231,191, 5,840,999, 5,847,162, and 6,080,852 (Lee et al.), PCT Publications WO 97/36960 
and WO 99/27020, Sauer et al, J. Fluorescence 5(3):247-261 (1995), Arden-Jacob, Neue 
5 Lanwellige Xanthen-Farbstoffe fur Fluoreszenzsonden und Farbstoff Laser, Verlag Shaker, 
Germany (1993), and Lee et al, Nucl. Acids Res. 20:2471-2483 (1992). Also included within 
the definition of "rhodamine-type xanthene ring" are the extended-conjugation xanthene rings of 
the extended rhodamine dyes described in US application Serial No. 09/325,243 filed June 3, 
1999. 

10 In another embodiment, the dye comprises a fluorescein-type parent xanthene ring 

having the structure: 
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In the fluorescein-type parent xanthene ring depicted above, one or more of the carbons at 
positions CI, C2, C4, C5, C7, C8 and C9 can be independently substituted with a wide variety 
of the same or different substituents, as described above for the parent xanthene rings. 

1 5 Exemplary fluorescein-type parent xanthene rings include, but are not limited to, the xanthene 
rings of the fluorescein dyes described in US Patents 4,439,356, 4,481,136, 5,188,934, 
5,654,442, and 5,840,999, WO 99/16832, and EP 050684. Also included within the definition 
of "fluorescein-type parent xanthene ring" are the extended xanthene rings of the fluorescein 
dyes described in US Patents 5,750,409 and 5,066,580. 

20 In another embodiment, the dye comprises a rhodamine dye, which comprises a 

rhodamine-type xanthene ring in which the C9 carbon atom is substituted with an orthocarboxy 
phenyl substituent (pendent phenyl group). Such compounds are also referred to herein as 
orthocarboxyfiuoresceins. A particularly preferred subset of rhodamine dyes are 4,7,- 
dichlororhodamines. Typical rhodamine dyes include, but are not limited to, rhodamine B, 5- 

25 carboxyrhodamine, rhodamine X (ROX), 4,7-dichlororhodamine X (dROX), rhodamine 6G 

(R6G), 4,7-dichlororhodamine 6G, rhodamine 110 (R110), 4,7-dichlororhodamine 110 (dRllO), 

tetramethyl rhodamine (TAMRA) and 4,7-dichloro-tetramethylrhodamine (dTAMRA). 

Additional rhodamine dyes can be found, for example, in US Patents 5,366,860 (Bergot et al.), 

5,847,162 (Lee et al.), 6,017,712 (Lee et al.), 6,025,505 (Lee et al.), 6,080,852 (Lee et al.), 

30 5,936,087 (Benson et al.), 6,111,116 (Benson et al.), 6,051,719 (Benson et al.), 5,750,409, 

20 
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5,366,860, 5,231,191, 5,840,999, and 5,847,162, US application Serial No. 09/325,243 filed 

June 3, 1999, PCT Publications WO 97/36960 and WO 99/27020, Sauer et al, 1995, J. 

Fluorescence 5(3):247-261, Arden- Jacob, 1993, Newe Lanwellige Xanthen-Farbstoffe fur 

Fluoresenzsonden und Farbstoff Laser, Verlag Shaker, Germany, and Lee et al., Nucl. Acids 
5 Res. 20(10):2471-2483 (1992), Lee et al., Nucl. Acids Res. 25:2816-2822 (1997), and 

Rosenblum et al., Nucl. Acids Res. 25:4500-4504 (1997), for example. In one embodiment, the 

dye is a 4,7-dichloro-orthocarboxyrhodamine. 

In another embodiment, the dye comprises a fluorescein dye, which comprises a 

fluorescein-type xanthene ring in which the C9 carbon atom is substituted with an orthocarboxy 
10 phenyl substituent (pendent phenyl group). A preferred subset of fluorescein-type dyes are 4,7,- 

dichlorofluoresceins. Typical fluorescein dyes include, but are not limited to, 5- 
j carboxyfluorescein (5-FAM), 6-carboxyfluorescein (6-FAM). Additional typical fluorescein 

dyes can be found, for example, in US Patents 5,750,409, 5,066,580, 4,439,356, 4,481,136, 

5,188,934 (Menchen et al.), 5,654,442 (Menchen et al.), 6,008,379 (Benson et al.), and 
15 5,840,999, PCT publication WO 99/16832, and EPO Publication 050684. In one embodiment, 

the dye is a 4,7-dichloro-orthocarboxyfluorescein. 

In other embodiments, the dye can be a cyanine, phthalocyanine, squaraine, or bodipy 

dye, such as described in the following references and references cited therein: Patent No. 

5,863,727 (Lee et al.), 5,800,996 (Lee et al.), 5,945,526 (Lee et al.), 6,080,868 (Lee et al.), 
20 5,436,134 (Haugland et al.), US 5,863,753 (Haugland et al.), 6,005,113 (Wu et al.), and WO 

96/04405 (Glazer et al.). 

Sometimes, the designation -1 or -2 is placed after an abbreviation of a particular dye, 

e.g., HEX-1. The "-1" and "-2" designations indicate that a particular 5- or 6-carboxy dye 

isomer being used. The 1 and 2 isomers are defined by order of elution (the 1 isomer elutes 
25 before the 2 isomer) of free dye in a reverse-phase chromatographic separation system utilizing a 

C-8 column and an elution gradient of 15% acetonitrile/85% 0.1 M triethylammonium acetate to 

35% acetonitrile / 65% 0.1 M triethylammonium acetate. 

Anionic linker L is designed to have an overall negative charge. Typically, this negative 

charge is provided by one or more negatively charged groups in the linker. If the anionic linker 
30 contains a positively charged group (e.g., bridging or nonbridging amino groups), then the linker 

must also contain a sufficient number of negatively charged groups to ensure that the total 

negative charge in the linker exceeds the total positive charge. In one embodiment, the linker 

contains no positively charged groups. The linker may contain one, two, or more net negative 

charges which may be provided by one, two, or more negatively charged groups. In one 
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embodiment, the linker contains a single negative charge. In another specific embodiment, the 
linker contains two negative charges which may be provided by a single group or two groups. 
In specific embodiments, the overall charge of the linker at pH 9 can be at least -1, -2, -3, or 
greater. Preferably, the overall charge of the linker at pH 9 is at least -1. By way of non- 
5 limiting examples, such anionic groups include: phosphate monoester (-OPO3 2 ), phosphodiester 
of the form -OP(=0)(CQO- (in which the phosphorus and two oxygen atoms are linker chain 
atoms), phosphodiester of the form -OP(=0)(0")(OR) (in which R is a masking group such as 
alkyl, alkenyl, alkynyl, aryl, alkaryl, etc., and the left-most oxygen is bound directly or indirectly 
to the linker chain), phosphonate monoester of the form -Y-P(=0)(0")0- (in which Y is an 

1 0 alkyl carbon, such as CH 2 , an ethene carbon, an ethyne carbon, or a benzene ring carbon, and 
preferably CH2), and Y, the phosphorus atom and the right-hand oxygen atom are linker chain 
atoms), phosphonate monoester of the form -Y-P(=0)(0")(OR) or -OP(=0)(0")(Z) (wherein Y 
is as just described, and R and Z are each masking groups as just defined for R, but none of the 
phosphonate monoester atoms are linker chain atoms), sulfate monoester (-OSO3), sulfonic acid 

15 (sulfonate, -SO3"), and carboxylic acid (carboxylate, -C0 2 "). In addition, anionic groups can 
include groups with a pK a < 10, such as nitrophenolate, thiolate, phenylthiolate, fluorinated alkyl 
alcohol (e.g., perfluoro hydroxymethyl or perfluoro hydroxyethyl), sulfonimides, and squarates. 

Anionic groups within a linker can be classified in various ways. First, anionic groups 
can be divided between bridging and non-bridging groups, depending on whether an anionic 

20 group is within the chain of linker atoms (a bridging anionic group) or is outside the chain of 
linker atoms (a non-bridging group). Examples of bridging anionic groups are phosphodiester 
of the form -OP(=0)(0")0- and phosphonate monoester of the form -CH 2 P(=0)(0~)0- 
Examples of non-bridging anionic groups are phosphate monoester (-OPO3 2 ), phosphodiester 
of the form -OP(=0)(0~)OR (where R is defined as above), phosphonic acid (-PO3 2 "), 

25 phosphonate monoester of the form -CH 2 P(=0)(0")(OR) or -OP(=0)(0")(Z) (where R and Z 
are defined as above), sulfate monoester, sulfonic acid, and carboxylic acid. Accordingly, the 
invention contemplates linkers that contain one or more bridging anionic groups, one or more 
divalent anionic groups, and combinations thereof. 

Anionic groups can also be classified by net formal charge. Examples of groups that 

30 provide a single negative charge at pH 9 include phosphodiester (both bridging and non- 
bridging), phosphonate monoester of the form -CH 2 P(=0)(0~)0- -CH 2 P(=0)(0")(OR), and 
— OP(=0)(0")(Z), sulfate monoester, sulfonic acid, and carboxylic acid. Examples of groups that 
provide a double negative charge at pH 9 include phosphate monoester (non-bridging) and 
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phosphonic acid of the form (-CH2PO3 2 ). Accordingly, the invention contemplates linkers that 
contain singly charged anionic groups, doubly charged anionic groups, and combinations 
thereof. 

Any of a variety of anionic linkers can be used. Typically, a linker between B and D will 
5 have a linker chain length of from about 4 to about 30 linker chain atoms, and typically from 4 
to 20 linker chain atoms, although shorter and longer linkers may also be used. Several 
exemplary anionic linkers are illustrated in the attached Figures and in the compounds prepared 
in the Examples below. 

The junction between a nucleobase B and linker L can be located at any suitable position 

10 on the nucleobase. Preferably, the attachment site on the nucleobase is selected so as not to 
interfere with or eliminate the H-bonding capability of the nucleobase with respect to a 
complementary nucleobase. When B includes a purine nucleobase, the linker is usually attached 
to the N-8-position of the purine. When B includes a 7-deazapurine nucleobase, the linker is 
usually attached to the N-7 -position of the 7-deazapurine. When B includes a pyrimidine base, the 

15 linkage is attached to the C-5-position of the pyrimidine. In a nucleoside, nucleotide, or 
polynucleotide subunit, a purine or 7-deazapurine is usually attached to a sugar moiety via the N-9- 
position of the purine or deazapurine, and a pyrimidine is usually attached to a sugar moiety via the 
N-l -position of the pyrimidine. 

The particular entity by which a linker is connected to a nucleobase can be any chemical 

20 group that is suitable for the purposes of the present invention. A variety of suitable chemical 
groups are known. For example, the terminal chemical group in the linker that is covalently 
attached to the nucleobase can be an acetylene moiety (-OC-), and often is a propargyl moiety 
(-CSCCH2-), since such linkage moieties tend to be particularly compatible with a variety of 
polymerase enzymes used for primer extension. However, non-acetylenic chemical groups are 

25 also contemplated. Examples of suitable terminal groups for attachment to a nucleobase can be 
found in the following exemplary references: 
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Table 1 



Aminopropargyl 


EPO Patent No. 251786B1 (Hobbs et al.) 

US Patents 5,151,507 and 5,047,519 (Hobbs et al.) 

Hobbs et al., J. Org. Chem., 54: 3420 (1989). 


Oxypropargyl or 
aminoethyloxypropargyl 


US Patent 5,821,356 (Khan et al.) 
US Patent 5,770,716 (Khan et al.) 
US Patent 5,936,087 (Benson et al.) 


-(CsC) n -Ar 0 -C=Cp- 
and variants thereof 


US Patent 5,948,648 (Khan et al.) 
US Patent 6,096,875 (Khan et al.) 


Acylethenyl 


US Patent 6,080,852 (Lee et al.) 


-OC-C 6 H 4 - 


US Patent 6,080,852 (Lee et al.) 


Ofher 


US Patent 6,008,379 (Benson et al.) 



The junction between linker L and dye moiety D can be located at any suitable position on 
5 the dye moiety, preferably so that the fluorescent properties of the dye are not adversely 
affected. For a xanthene-type ring, the linker can be joined to any available carbon atom, or to 
one of the nitrogen atoms in a rhodamine-type xanthene ring. For a rhodamine dye or 
fluorescein dye, the substituent positions on the pendent phenyl ring are also available, 
particularly the positions which are para to C9 of the xanthene ring (5 position), or para to the 

10 ortho carboxyl group (6 position). In addition, the particular chemical group by which a linker 
is connected to a nucleobase can be any chemical group that is suitable for the purposes of the 
present invention. A variety of chemical groups and points of attachment on various dyes can be 
found, for example, in US Patents 5,654,442 and 5,188,934 (Menchen et al.), 6,020,481 (Benson 
et al.), 5,800,996 (Lee et al.), 6,025,505 (Lee et al.), 5,821,356 (Khan et al.), 5,770,716 (Khan et 

15 al.), 6,088,379 (Benson et al.), 6,051,719 (Benson et al.), 6,096,875 (Khan et al.), 6,080,868 
(Lee et al.), US patent application Ser. No. 09/325,243 filed June 3, 1999 (Lam et al.), 
09/498,702 filed Feb 7, 2000 (Upadhya et al.), 09/564,417 filed May 2, 2000 (Menchen et al.), 
and Ser. No. 09/433,093 filed November 3, 1999 (Lee et al.). In one preferred embodiment, for 
xanthene derivatives that contain a C9 phenyl group, such as a rhodamine dye or fluorescein 

20 dye, the linker is attached to the dye via a 5-carboxyphenyl (para to the xanthene C9 carbon 
atom) or 6-carboxyphenyl group (meta to the xanthene C9 carbon atom). In another preferred 
embodiment, for xanthene dyes generally, the linker is preferably attached to a 4-carbon atom or 
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5-carbon atom on the xanthene ring. In a third preferred embodiment, for rhodamine-type 
xanthene dyes and rhodamine dyes, the linker is attached to the 3 or 6-nitrogen atom of the 
xanthene ring. Further guidance for forming conjugates of the invention can be found below 
with reference to the Examples herein. 
5 The dye-labeled nucleobase of the invention may also have the form B-L1-D1-L2-D2, 

wherein B is a nucleobase, LI and L2 are linkers such that at least one of LI and L2 is an 
anionic linker, and Dl and D2 are members of a fluorescent donor/acceptor pair, hi one 
embodiment, Dl is a donor dye, and D2 is an acceptor dye. In another embodiment, D2 is a 
donor dye, and Dl is an acceptor dye. For donor/acceptor pairs, it is understood that the donor 

10 dye and acceptor dye have different (non-identical) spectral properties. Thus, although the 
donor and acceptor may have the same type of aromatic ring structure (e.g., when both the donor 
and acceptor are fluorescein dyes, or both are rhodamine dyes), different spectral properties can 
arise for the donor and acceptor due to the nature of the substituents on each one. The donor dye 
is effective to enhance the intensity of fluorescence emission of the acceptor dye relative to the 

15 intensity that would be observed in the absence of the donor dye under the same conditions. 
Conjugates of this form may be referred to herein as "FRET probes", "FRET-labeled 
conjugates" or FRET-labeled nucleotides because upon excitation of the donor dye, the 
conjugate can undergo nonradiative fluorescence resonance energy transfer from the donor to 
the acceptor, such that the acceptor dye can then emit fluorescent light at a second wavelength in 
- 20 response thereto. 

The donor dye and acceptor dye can be any fluorescent dye, and are each preferably 
fluorescent aromatic dyes. For example, the donor and acceptor dye, taken separately, can be a 
xanthene, rhodamine, dibenzorhodamine, fluorescein, [8,9]benzophenoxazine, cyanine, 
phthalocyanine, squaraine, or bodipy dye. Furthermore, the donor and acceptor dyes can be 

25 linked together using any of a variety of attachment sites on each dye. For example, if Dl is a 
fluorescein and D2 is a rhodamine (both of which contain pendent phenyl groups attached to C9 
of the xanthene rings), Dl can be linked via its xanthene ring (preferably via C4)) to the pendent 
phenyl ring of D2 (e.g., via a 5- or 6-carboxy group on the pendent phenyl group). This is 
referred to as a head to tail arrangement. Alternatively, the positions of the connections can be 

30 reversed, such that D2 is linked via its xanthene ring to the pendent phenyl ring of Dl (another 

example of a head to tail arrangement). In other alternatives, Dl and D2 can be connected tail to 

tail, via their pendent phenyl rings, or head to head, via their xanthene rings, for example. 

As noted above, at least one of LI and L2 is an anionic linker. The properties of such 

anionic linkers are generally as discussed for anionic linker L above. 
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In one embodiment, LI is an anionic linker and L2 is a non-anionic linker. For L2, any 
of a variety of non-anionic linkers can be used to connect Dl to D2. General considerations for 
forming donor-acceptor conjugates are discussed in US Patents 5,863,727, 5,800,996, 5,945,526, 
and 6,008,379, for example. In one set of embodiments, D1-L2-D2 may comprise one of 
5 structures (a), (b) or (c) below: 

(a) -D1-R 21 Z,C(0)R 2 2R28-D2 

(b) -D1-R 28 R 2 2C(0)Z 1 R 21 -D2 

(c) -D1-R 28 R 22 R 28 -D2 

10 

wherein: R 2 i is C1-C5 alkyldiyl, Z\ is NH, S, or O, R 22 is an alkene, diene, alkyne, or a 5- or 6- 
membered ring having at least one unsaturated bond or a fused ring structure, and R 2 g is a bond 
or spacer group. Details and examples of such inter-dye linkers can be found in US Patent 
5,800,996, for example. In certain embodiments, R 22 is ethenediyl, ethynediyl, 1,3- 
15 butadienediyl, or 1,3-butadiynediyl. 

In another embodiment, LI is a non-anionic linker and L2 is an anionic linker. In this 
case, any of a variety of non-anionic linkers can be used to connect B to Dl. Descriptions of 
exemplary nonanionic linkers can be found in the references in Table 1 above. For example, LI 
can be or contain any of the following non-limiting examples: 

20 

-OCCH 2 NH- 

-C=CCH 2 OCH 2 CH 2 NH- 

-C=CCH 2 OCH 2 CH 2 OCH 2 CH 2 NH- 

-CsCCH 2 NHC(0)(CH 2 ) 5 NH- 
25 -CsCC(0)NH(CH 2 ) 5 NH- 

-C=CHC(0)NH(CH 2 ) 5 NH- 

-CsC-(p-C 4 H 6 )OCH 2 CH 2 NH- 

-CsC-(p-C6H4)OCH 2 CH 2 NH- 

-C=C-(p-C 6 H4)-(p-C6H4)-<>C- 
30 -C=C-(p-C 4 H 6 )- 

-OC-CsC- 

wherein the left-hand ethene or ethyne moiety is linked to the nucleobase, and the right hand 
bond is typically linked directly to the dye or is linked indirectly to the dye through a carbonyl 
35 group. Additional nonanionic linkers are shown in the exemplary compounds in the attached 
Figures. 
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More generally, non-anionic linkers encompass linkers that are either charge-neutral or 
are positively charged. Charge-neutral linkers refer to linkers that either contain no charged 
groups at pH 7 (i.e., has no charged group having a pKa between 6 and 8), or contain equal 
numbers of positively and negatively charged groups which cancel to provide a net charge of 
5 zero. A positively charged linker is positively charged at pH 7, e.g., due to the presence of an 
ammonium ion or imidazole ion, for example. Preferably, a non-anionic linker is a charge- 
neutral linker. Preferably, the charge-neutral linker contains no charged groups at pH 7. 

When LI and L2 are both anionic linkers, the structures of LI and L2 can be the same or 
different, and the anionic group(s) in the linkers can also be the same or different. Further 
1 0 guidance regarding conjugate structures in accordance with the invention is provided below. 

Compounds of the invention may be prepared by any suitable synthetic method. 
Typically, conjugates of the invention are formed using a modular approach in which a 
nucleobase (which may be provided in the form of a nucleoside or nucleotide containing the 
nucleobase, for example), a first dye, a second dye (if present), and one or more linkers or linker 
15 precursors, are combined in serial and/or parallel steps to produce the desired labeled product. 
Several exemplary approaches are illustrated in the Examples below, which describe syntheses 
of several different dye-labeled nucleotides containing linkers of various lengths and 
compositions. 

Example 1 describes a synthetic method for preparing a labeled nucleotide in accordance 

20 with the invention, which contains a (i) first dye linked via C-8 of a 7-deazaadenine nucleobase 

by an anionic linker that contains a phosphate diester moiety within the chain of linker atoms, 

and (ii) a second dye linked to the first dye by a charge-neutral (non-anionic) linker. The linker 

between the nucleobase and the first dye contains 13 linker chain atoms. The linker between the 

first and second dyes contains 10 linker atoms. In this example, a bifunctional linker moiety 7 is 

25 formed in several steps by first reacting a cyclic phosphoramidite 1 with methyl glycolate 2, 

followed by oxidation, to form phosphate compound 3. After removal of a methyl group from 

the phosphate to produce 4, and deprotection of the amino and carboxyl groups to produce 

compound 5, an Fmoc protecting group is attached to the amine to produce carboxylic acid 6. 

Activation of the carboxylic acid with N-hydroxysuccinimide (NHS) produces ester 7, which is 

30 a versatile linker synthon. Ester 7 is then reacted with 7-aminopropargyl-7-deazaadenosine 

triphosphate 8 to form compound 9. For reaction with compound 9, dye compound 11 was 

prepared by reacting ^-aminomethylbenzoic acid with Fmoc acid chloride to form Fmoc 

protected />-aminomethylbenzoic acid. After activation of the benzoic acid with NHS, the NHS 

ester product was reacted with 4'-aminomethyl-6-carboxyfluorescein to produce the expected 
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amide adduct. The 6-carboxyl group was then reacted with NHS to produce dye compound 11. 
Reaction of compounds 10 and 11 produced adduct 12, in which an anionic (phosphate- 
containing) linker is formed fully between the fluorescein dye and the nucleobase. After 
removal of the Fmoc group, the resultant free amine compound 13 was reacted with rhodamine 
5 NHS ester 14 to form dye-labeled nucleotide 15. 

An alternative method for preparing compound 10 used in Example 1 is provided in 
Example 2. This Example describes a synthetic approach in which a linker synthon 19 
containing a phosphate monoester is prepared for reaction with a nucleoside containing an 
iodinated nucleobase (7-iodo-7-deazaadenine). In brief, 3-amino-l-propyne 16 is reacted with 

10 methyl glycolate 2 to form the expected amide product 17. Reaction of 17 with cyclic 
phosphoramidite 1 followed by oxidation affords phosphate triester 18. After removal of the 
phosphate group, resultant phosphodiester compound 19 is reacted with iodo-nucleoside 20 to 
afford adduct 21. The 5'-hydroxyl group of the nucleoside can be converted to a triphosphate 
group by reaction with phosphorous oxychloride to form dichloromonophosphate 22, followed 

15 by addition of pyrophosphate to provide nucleoside triphosphate 23. Removal of the 
trifluoracetyl protecting group from the terminal amine nitrogen provides synthon 10. 

Example 4 describes synthesis of a synthetic method for preparing a FRET-labeled 
nucleotide which contains a (i) first dye linked to a 7-deazaadenine nucleobase by a charge- 
neutral (non-anionic) linker, and (ii) a second dye linked to the first dye by an anionic linker that 

20 contains a monoanionic sulfonic acid moiety. The linker between the nucleobase and the first 
dye contains 5 linker chain atoms. The linker between the first and second dyes contains 10 
linker atoms. As detailed in Example 4, ^-aminomethylbenzoic acid 24 is reacted with sulfuric 
acid to form the meta-sulfonated product 25. Reaction with Fmoc-succinimide affords Fmoc 
protected amine 26, which is then reacted with N-hydroxysuccinimide to form NHS ester 27. 

25 For reaction with NHS ester 27, dye-labeled nucleoside triphosphate 30 can be prepared by 
reacting aminopropargyl nucleoside triphosphate 8 with dye intermediate 28 (a fluorescein dye 
containing a trifluoroacetyl-protected 4'-aminomethyl group and an NHS ester of a 6-carboxyl 
group) to afford dye-labeled nucleotide 29, followed by removal of the trifluoroacetyl (TFA) 
group to afford amine compound 30. Reaction of NHS ester 27 and compound 30 affords 

30 Fmoc-protected compound 31. After removal of the Fmoc group, resultant amine compound 32 
is reacted with dye NHS ester 14 to afford dye-labeled nucleotide 33. It can be seen that the 
linker between the first and second dyes contains a sulfonic acid group which is attached to a 
benzene moiety in the linker chain. 
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In Example 5, a protocol is described for preparing a labeled nucleotide in which first 
and second dyes of a donor-acceptor pair are linked by a phosphate-containing linker, and the 
donor dye is linked to a nucleobase by a charge-neutral linker. This nucleotide product differs 
from the product of Example 1 since the anionic phosphate linker is located between the first 
5 dye and the nucleobase. The product of Example 5 also differs from the product of Example 4 
since the anionic linker between the two dyes contains a phosphate monoester in the linker 
chain, rather than a sulfonic acid group attached to the linker chain. The compounds also differ 
in the lengths of some of the linkers. 

As detailed in Example 5, aminopropargyl nucleotide 8 is combined with dye NHS ester 
10 11 to afford Fmoc-protected compound 34. Following removal of the Fmoc group, resultant 
amine 35 is reacted with ester 7 to produce Fmoc protected dye-labeled nucleotide 36. After 
removal of the Fmoc group, resultant amine 37 is reacted with dye NHS ester 14 to afford dye- 
labeled nucleotide 38. 

A method for preparing a FRET-labeled nucleotide containing two anionic linkers is 

15 described in Example 6. In particular, the anionic linker between the nucleobase and first dye 
contains a phosphate diester moiety, and the anionic linker between the first and second dyes 
contains a sulfonic acid moiety. In the method described in Example 6, nucleotide amine 10 
~ (Examples 1 and 2) is reacted with dye NHS ester 28 to form dye-labeled nucleotide 39, in 

which the dye and nucleotide are linked by a phosphate-containing linker. Removal of the Fmoc 

20 group produces amine compound 40, which is reacted with sulfonate-containing NHS ester 27 to 
afford Fmoc-protected compound 41. After removal of the Fmoc group, resultant amine 42 is 
reacted with dye NHS ester 14 to afford dye-labeled nucleotide 43. 

Example 7 illustrates how the dye-labeled nucleotide compound 40 from Example 6 can 
be used to form a conjugate of the invention by an alternative route, relative to the route 

25 described in Example 1. In Example 1, the main linker synthon between the first and second 
dyes is provided as part of compound 11, which contains a first dye (for attachment to the 
nucleobase) and an Fmoc-protected linker synthon that is attached to the first dye. In the 
method of Example 7, the linker is provided as part of a compound (44) which contains a linker 
synthon attached to the second dye. This compound can be reacted with dye-labeled nucleotide 

30 40 to obtain desired product 15. Thus, Example 7 illustrates how the order of connection of 

various synthons can be varied, if desired, to synthesize a particular compound of the invention. 

Examples 8A-8B describe methods for preparing two FRET-labeled nucleotides that are 

similar to the product of Example 1 , except that the nucleobase is cytosine. In Example 8 A, the 

linkers are the same as for the product of Example 1 . In Example 8B, the linker between the 
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nucleobase and first dye is longer than the corresponding linker in Example 8A, due to the 
inclusion of an ethoxy group inserted after the propargyl group. 

Examples 9, 10 and 11 describe methods for making additional FRET-labeled 
nucleobases (thymine) which contain selected anionic linkers between first and second dyes. 
5 The method in Example 9 produces a product 64 that has a sulfonate-containing anionic linker 
between the two dyes, and a charge-neutral linker between the first dye and nucleobase. Product 
69 from Example 10 has a phosphate-containing anionic linker between the first and second 
dyes. This anionic linker is also longer than the linker of Example 9 (18 linker chain atoms 
versus 10 linker chain atoms). The method in Example 11 produces a product 77 similar to 

10 product 69, except that the nucleobase is 7-deazaguanosine, and the attached dye is different. 

Methods for preparing several exemplary dye-labeled conjugates of the form B-L-D are 
provided in Examples 12, 13 and 18. Example 12A describes a method for preparing a dye- 
labeled conjugate 82 having an anionic linker that contains a sulfonate group, as illustrated by a 
sulfonated benzene moiety. In Example 12B, dye-labeled conjugate 85 has a dianionic linker 

15 (net formal charge of -2) that contains both a sulfonate group and a phosphate group. In 
Example 13, dye-labeled conjugate 95 has an anionic linker containing a phosphodiester moiety 
within the linker chain. 

Example 18 (see also Figures 22A and 22B) provides a method of forming a conjugate of 
the form B-L-D having a linker that comprises a carboxyl anionic group, as illustrated with a 

20 carboxy benzene moiety as part of the linker. It will be appreciated that carboxylic acid groups 
can also be included in linkers in other ways, by preparing appropriate carboxylated linker 
synthons. 

Examples 14, 15 and 16 provide additional methods for preparing FRET-labeled 
conjugates wherein the nucleobase and first dye are linked by an anionic linker, and the donor 

25 and acceptor dyes are linked by a charge-neutral linker. The nucleobase in Examples 14 and 16 
is 7-deazaadenine. The nucleobase in Example 15 is 7-deazaguanine. In Example 14, a 
phosphodiester moiety is linked to a 7-propargyl group on the nucleobase, and the remainder of 
the linker is provided as an ethylaminoacyl moiety linked to the pendent phenyl ring of a 
fluorescein dye. In Example 1 5, a phosphodiester moiety is linked to the nucleobase by a 7- 

30 propargylphenylpropynyl group, and the remainder of the linker is provided as an 
ethylaminoacyl moiety. In Example 16, a phosphonate monoester is linked to the nucleobase by 
a methylacylaminopropargyl group, and the remainder of the linker is provided as an 
ethylaminoacyl moiety. 
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The foregoing examples illustrate a broad variety of dye-labeled nucleobase compounds 
in accordance with the invention, including four different types of nucleobases, linkers of 
different compositions and lengths, a number of different types of dyes, and various 
combinations thereof. Several specific examples of conjugates of the form B-L1-D1-L2-D2 are 
5 described wherein LI, L2 or both, are anionic linkers of various types and lengths. Specific 
examples of conjugates of the form B-L-D are also provided (Examples 12A, 12B, and 13), and 
modifications should be immediately apparent from the FRET pair Examples in which LI is an 
anionic linker. 

The present invention also includes nucleosides and nucleotides containing conjugates in 
10 accordance with the invention. Particularly preferred nucleosides/tides of the present invention are 
shown below in the following formula: 




W 2 Wi 

wherein Wj is OH, H, F, CI, NH 2 , N 3 , or OR, where R is C1-C6 alkyl (e.g., OCH 3 or 
OCH2CH3); W 2 is OH or a group capable of blocking polymerase-mediated template-directed 

15 primer extension (such as H, F, CI, NH 2 , N 3 , or OR, where R is C1-C6 alkyl (e.g., OCH 3 or 
OCH2CH3)); W3 is OH, or mono-, di- or triphosphate or a phosphate analog thereof; and LB 
(labeled base) represents a dye-labeled nucleobase conjugate of the invention. In one embodiment, 
Wj is not OH. In another embodiment, W 2 is not OH, so that the compound is not 3' extendable. In 
another embodiment, Wj and W 2 are selected from H, F, and NH 2 . In further embodiments, Wi is 

20 F and W 2 is H, or Wi is H and W 2 is F, or Wi and W 2 are each F, or Wi and W 2 are each H. In 
addition, for each of the foregoing embodiments for Wi alone, W 2 alone, and Wj and W 2 in 
combination, it is contemplated that W 3 can be OH, monophosphate, diphosphate, or triphosphate. 
For LB, exemplary nucleobases include adenine, 7-deazaadenine, 7-deaza-8-azaadenine, cytosine, 
guanine, 7-deazaguanine, 7-deaza-8-azaguanine, thymine, uracil, and inosine. 

25 For example, in one particular embodiment, when W 3 is triphosphate, the present invention 

includes nucleotide triphosphates having the structure shown in the formula below: 
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X H 



wherein X is H or F. Such terminator nucleotides, and others discussed above which lack a 3' OH 
group, find particular application as chain terminating agents in Sanger-type DNA sequencing 
methods utilizing fluorescent detection, and also in minisequencing. 
5 hi another embodiment, the invention includes deoxynucleotide triphosphates having the 

structure shown in the formula below: 

fl tf fl 

'O— p— o— p— o— p— o— ™ T ° 
O" O" O" 

OH H 

wherein LB is defined as above. Such compounds are examples of 3' extendable nucleotides. 
Labeled 2'-deoxynucleotides of this type find particular application as reagents for labeling 
1 0 polymerase extension products, e.g., in the polymerase chain reaction and nick-translation. 

In yet another embodiment, the invention includes ribonucleotide triphosphates having the 
structure shown in the formula below: 




It II II 

o- o- o- JQ| 

OH OH 

15 wherein LB is defined as above. Labeled nucleotides of this type find particular application as 
reagents for and in sequencing methods that utilize labile nucleotides having cleavable 
internucleotide linkages, as discussed for example in US Patent 5,939,292 (Gelfand et al.), 
Eckstein, Nucl. Acids Res. 16:9947-9959 (1988), and Shaw, Nucl. Acids Res. 23:4495 (1995). 

The invention also provides polynucleotides and mixtures of polynucleotides that contain 
20 one or more different nucleobase-dye conjugates of the type discussed above. Such 
polynucleotides are useful in a number of important contexts, such as DNA sequencing, ligation 
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assays, the polymerase chain reaction (PCR), probe hybridization assays, and various other 
sequence detection or quantitation methods. 

Dye-containing polynucleotides (also referred to herein as labeled polynucleotides) may be 
synthesized enzymatically, e.g., using a DNA or RNA polymerase, nucleotidyl transferase, ligase, 
5 or other enzymes, e.g., Stryer, Biochemistry, Chapter 24, W.H. Freeman and Company (1981), or 
by chemical synthesis, e.g., by the phosphoramidite method, the phosphite-triester method, or the 
like. Dye-labels of the invention may be introduced during enzymatic synthesis utilizing labeled 
nucleotide triphosphate monomers as described above, or during chemical synthesis using labeled 
non-nucleoside or nucleoside phosphoramidites, or may be introduced subsequent to synthesis. 

10 Exemplary methods for forming labeled polynucleotides can be found in Sambrook et al., 
Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory Press, NY 
(1 989), US Patents 6,008,379 (Benson et al.), and references cited therein. 

Generally, if a labeled polynucleotide is made using enzymatic synthesis, the following 
procedure may be used. An oligonucleotide primer is annealed to a complementary sequence in a 

15 template DNA strand. A mixture of deoxynucleotide triphosphates (such as dGTP, dATP, dCTP, 
and dTTP) is added, where at least one of the deoxynucleotides contains a nucleobase-dye 
conjugate of the invention. In the presence of a polymerase enzyme, a dye-labeled polynucleotide 
is formed by incorporation of a labeled deoxynucleotide during polymerase-mediated strand 
synthesis. In an alternative enzymatic synthesis method, two primers are used instead of one, one 

20 primer complementary to the + strand and the other complementary to the - strand of the target, the 
polymerase is a thermostable polymerase, and the reaction temperature is cycled between a 
denaturation temperature and an extension temperature, thereby exponentially synthesizing 
(amplifying) a labeled complement to the target sequence by PCR, e.g., PCR Protocols, funis et al. 
eds., Academic Press (1990). 

25 Labeled polynucleotides may be chemically synthesized using any suitable method, such as 

the phosphoramidite method. Detailed descriptions of the chemistry used to form 
polynucleotides by the phosphoramidite method are provided elsewhere, e.g., Caruthers et al., 
U.S. Patents No. 4,458,066 and 4,415,732, Caruthers et al, Genetic Engineering 4: 1-17 (1982), 
Users Manual Model 392 and 394 DNA/RNA Synthesizers, pages 6-1 through 6-22, Applied 

30 Biosystems, Part No. 901237 (1991). Descriptions of the phosphoramidite method and other 
synthesis methods for making polynucleotides containing standard phosphodiester linkages or 
linkage analogs can be found in Gait, Oligonucleotide Synthesis, IRL Press (1990), and S. 
Agrawal, Protocols for Oligonucleotides and Analogs, Methods in Molecular Biology Vol. 20, 
Humana Press, Totowa, NJ (1993). 
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The phosphoramidite method is a preferred method because of its efficient and rapid 
coupling and the stability of the starting materials. The synthesis is performed with a growing 
polynucleotide chain attached to a solid support, so that excess reagents, which are in the liquid 
phase, can be easily removed by filtration, thereby eliminating the need for purification steps 
5 between synthesis cycles. 

III. Methods 

The nucleobase-dye conjugates of the present invention are suited for any method utilizing 
fluorescent detection, particularly methods requiring simultaneous detection of analytes which are 
1 0 not well separated by electrophoresis. The present invention is particularly well suited for detecting 
classes of polynucleotides that have been subjected to a biochemical separation procedure, such as 
electrophoresis. 

In one aspect, the invention provides a method of identifying one or more 
polynucleotide(s). The method utilizes one or more labeled different-sequence polynucleotides, 

15 which may have the same lengths or different lengths, wherein each different-sequence 
polynucleotide contains a unique nucleobase-dye conjugate. The one or more labeled different- 
sequence polynucleotides are separated by electrophoresis to separate different-sequence 
polynucleotides on the basis of size. Each different-sequence polynucleotide can then be 
identified on the basis of its electrophoretic mobility and fluorescence signal. 

20 The polynucleotide(s) may be formed by any appropriate method, with the proviso that 

each different polynucleotide is identifiable on the basis of a unique combination of 
electrophoretic mobility and fluorescence signal. For example, two different polynucleotides 
may contain identical dye moieties but may exhibit different electrophoretic mobilities. 
Alternatively, two different polynucleotides can contain different dye moieties that produce 

25 distinct (spectrally resolvable) fluorescence signals but can exhibit the same electrophoretic 
mobilities. In another example, different polynucleotides can differ in both their fluorescence 
signals and mobilities. 

In one embodiment, the method can be used in a multiplex format in which different 
labeled polynucleotides are formed by reaction with (i) a plurality of different target sequences 
30 and (ii) a plurality of different polynucleotides that are complementary to the target sequences. 
For example, the different polynucleotide can be designed to undergo a change in structure after 
hybridization to their complementary target sequences in a polynucleotide sample, e.g., due to 
modification by enzyme action, thereby producing different labeled polynucleotides having 
unique combinations of mobility and fluorescence to allow identification. Such reactions can be 
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performed simultaneously in a single reaction mixture or can be performed in separate reaction 
mixtures that can be combined prior to electrophoretic separation. Several examplary assay 
formats for producing such labeled polynucleotides are discussed below. 

Sanger-type sequencing involves the synthesis of a DNA strand by a DNA polymerase in 
5 vitro using a single-stranded or double-stranded DNA template whose sequence is to be 
determined or confirmed. Synthesis is initiated at a defined site based on where an oligonucleotide 
primer anneals to the template. The synthesis reaction is terminated by incorporation of a 
nucleotide analog that will not support continued DNA elongation. Exemplary chain-terminating 
nucleotide analogs include the 2',3'-dideoxynucleoside 5'-triphosphates (ddNTPs) which lack the 3'- 

10 OH group necessary for 5' to 3' DNA chain elongation. When proper proportions of dNTPs (2'- 
deoxynucleoside 5 -triphosphates) and one of the four ddNTPs are used, enzyme-catalyzed 
polymerization will be terminated in a fraction of the population of chains at each site where the 
ddNTP is incorporated. If labeled ddNTPs are used for each reaction, a desired sequence read can 
be obtained by detection of the fluorescence signals of the terminated chains during or after 

15 separation by high-resolution electrophoresis. In the chain termination method, dyes of the 
invention can be attached to either sequencing primers or terminator nucleotides. 

In "fragment analysis" or "genetic analysis" methods, labeled polynucleotide fragments can 
be generated through template-directed enzymatic synthesis using labeled primers or nucleotides, 
e.g., by polynucleotide ligation or polymerase-directed primer extension. The resultant fragments 

20 are then subjected to a size-dependent separation process, e.g., electrophoresis or chromatography, 
and the separated fragments are detected, e.g., by laser-induced fluorescence. In a particular 
embodiment, multiple classes of polynucleotides are separated simultaneously and the different 
classes are distinguished by spectrally resolvable labels. 

A fragment analysis method, known as amplified fragment length polymorphisim detection 

25 (AmpFLP), is based on amplified fragment length polymorphisms, i.e., restriction fragment length 
polymorphisms that are amplified by PCR. These amplified fragments of varying size serve as 
linked markers for following mutant genes in family lineages. The closer the amplified fragment is 
to the mutant gene on the chromosome, the higher the linkage correlation. Because genes for many 
genetic disorders have not been identified, these linkage markers serve to help evaluate disease risk 

30 or paternity. In the AmpFLP technique, the polynucleotides may be labeled by using a labeled 
polynucleotide PCR primer, or by utilizing labeled nucleotide triphosphates in the PCR. 

In another fragment analysis method, known as nick translation, one or more unlabeled 
nucleotide subunits in a double-stranded DNA molecule are replaced with labeled subunits. Free 
3'-hydroxyl groups are created within the unlabeled DNA by "nicks" caused by treatment with 
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deoxyribonuclease I (DNAase I). The DNA polymerase I then catalyzes the addition of one or 
more labeled nucleotides to the 3'-hydroxyl of the nick. At the same time, the 5' to 3'-exonuclease 
activity of this enzyme can remove one or more nucleotide subunits from the 5'-phosphoryl 
terminus of the nick. A new nucleotide with a free 3'-OH group is incorporated at the position of 
5 the excised nucleotide, and the nick is shifted along by one nucleotide unit in the 3' direction. This 
3' shift will result in the sequential addition of new labeled nucleotides to the DNA with the 
removal of existing unlabeled nucleotides. The nick-translated polynucleotide is then analyzed 
using a separation process, e.g., electrophoresis. 

Another exemplary fragment analysis method is based on the variable number of tandem 

10 repeats, or VNTRs. VNTRs are regions of double-stranded DNA that contain adjacent multiple 
copies of a particular sequence, with the number of repeating units being variable among different 
members of a population (e.g., of humans). Examples of VNTR loci are pYNZ22, pMCTl 18, and 
Apo B. A subset of VNTR methods are based on the detection of microsatellite repeats, or short 
tandem repeats (STRs), i.e., tandem repeats of DNA characterized by a short (2-4 bases) repeated 

15 sequence. One of the most abundant interspersed repetitive DNA families in humans is the (dC- 
dA)n~(dG-dT)n dinucleotide repeat family (also called the (CA)n dinucleotide repeat family). 
There are thought to be as many as 50,000 to 100,000 (CA)n repeat regions in the human genome, 
typically with 15-30 repeats per block. Many of these repeat regions are polymorphic in length and 
can therefore serve as useful genetic markers. Preferably, in VNTR or STR methods, label is 

20 introduced into the polynucleotide fragments by using a dye-labeled PCR primer. 

In another example, known sometimes as an oligonucleotide ligation assay (OLA), two 
polynucleotides (probe pair) which are complementary to adjacent regions in a target sequence 
are hybridized to the target region of a polynucleotide, to create a nicked duplex structure in 
which the ends of the two polynucleotide abut each other. When the ends of the hybridized 

25 polynucleotide probes match (basepair with) corresponding subunits in the target, the two probes 
can be joined by ligation, e.g., by treatment with ligase. The ligated product is then detected, 
evidencing the presence of the target sequence. In a modification of this approach, known as the 
ligation chain reaction (or ligation amplification reaction), the ligation product acts as a template 
for a second pair of polynucletide probes which are complementary to the ligated product 

30 fromthe first pair. With continued cycles of denaturation, reannealing and ligation in the 

presence of the two complementary pairs of probe, the target sequence is amplified 

exponentially, allowing very small amounts of target sequence to be detected and/or amplified. 

Exemplary conditions for carrying out such processes, including chemical ligation formats, are 

described in US Patents 5,962,223 (Whiteley et al.), 4,988,617 (Landegren et al.), and 5,476,930 
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(Letsinger et al.), and European Patent Publications EP 246864A (Carr et al.), EP 336731 A 
(Wallace), and EP 3246 16A (Royer et al.). 

Conveniently, a fragment analysis method such as any of those discussed above can be 
performed in a multi-probe format, in which a sample is reacted with a plurality of different 
5 polynucleotide probes or probe sets which are each specific for a different target sequence, such 
as different alleles of a genetic locus and/or different loci. The probes are designed to have a 
unique combination of mobility and fluorescence signal, to permit specific detection of the 
individual probes or probe products that are generated in the assay as a result of the presence of 
the different target sequences. 

10 hi the above fragment analysis methods, labeled polynucleotides are preferably separated 

by electrophoretic procedures. Methods for electrophoresis of nucleic acids are well known and are 
described, for example in Rickwood and Hames, Eds., Gel Electrophoresis of Nucleic Acids: A 
Practical Approach, IRL Press Limited, London (1981), Osterman, Methods of Protein and 
Nucleic Acid Research, Vol. 1 Springer-Verlag,. Berlin (1984), Sambrook et al. (1989, supra), P.D. 
- 15 Grossman and J.C. Colburn, Capillary Electrophoresis: Theory and Practice, Academic Press, Inc., 
NY (1992), and U.S. Patents 5,374,527, 5,624,800 and/or 5,552,028. Typically, the electrophoretic 
matrix contains crosslinked or uncrosslinked polyacrylamide having a concentration (weight to 
volume) of between about 2-20 weight percent, and often about 4 to 8 percent. For DNA 
sequencing, the electrophoresis matrix usually includes a denaturing agent such as urea, formamide, 

20 or the like. Detailed exemplary procedures for forming such matrices are given by Maniatis et al., 
"Fractionation of Low Molecular Weight DNA and RNA in Polyacrylamide Gels Containing 98% 
Formamide or 7 M Urea," in Methods in Enzymology, 65: 299-305 (1980), Sambrook et al. (1989, 
supra), and ABI PRISM™ 377 DNA Sequencer User's Manual Rev. A, January 1995, Chapter 2 
(p/n 903433), Applied Biosystems, Foster City, CA). A variety of suitable electrophoresis media 

25 are also commercially available from Applied Biosystems and other vendors, including non- 
crosslinked media, for use with automated instruments such as the Applied Biosystems "3700" and 
"3100" Instruments, by way of example. Optimal electrophoresis conditions, e.g., polymer 
concentration, pH, temperature, voltage, concentration of denaturing agent, employed in a 
particular separation depends on many factors, including the size range of the nucleic acids to be 

30 separated, their base compositions, whether they are single stranded or double stranded, and the 
nature of the polynucleotides for which information is sought by electrophoresis. Accordingly 
application of the invention may require standard preliminary testing to optimize conditions for 
particular separations. 
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During or after electrophoretic separation, the labeled polynucleotides can be detected or 
identified by recording fluorescence signals and migration times (or migration distances) of the 
separated polynucleotides, or by constructing a chart of relative fluorescent and order of migration 
of the polynucleotides (e.g., as an electropherogram). To perform such detection, the labeled 
5 polynucleotides can be illuminated by standard means, e.g. a high intensity mercury vapor lamp, a 
laser, or the like. Typically, the labeled polynucleotides are illuminated by laser light generated by 
a He-Ne gas laser or a solid-state diode laser. The fluorescence signals can then be detected by a 
light-sensitive detector, e.g., a photomultiplier tube, a charged-coupled device, or the like. 
Exemplary electrophoresis detection systems are described elsewhere, e.g., U.S. Patent Nos. 

10 5,543,026, 5,274,240, 4,879,012, 5,091,652 and 4,81 1,218. 

Figs. 19A-19B, 20A-20B, and 21A-21B show various results in support of the present 
invention. The results were obtained for sequencing fragments prepared by template-dependent 
primer extension in the presence of selected dye-labeled terminators following the methodology 
described in US Patent 6,096,875 (see Example 12, except that the primers were not labeled). 

15 Primer extension reactions were performed in the presence of a single type of terminator to 
generate a ladder of extension fragments. The two extension reactions in each study differed 
only in the linkers that were present between the nucleobase and the dye, or between a donor dye 
and acceptor dye. Specifically, the linker in one reaction mixture (A) is non-anionic, whereas 
the linker in the other reaction mixture (B) is an exemplary anionic linker. 

"20 For Figs. 19A and 19B, two primer extension reactions were performed using 7-deaza 

analogs of dideoxyguanosine-5' -triphosphate (ddGTP). Both terminators contained only one 
dye, which is a dibenzoxanthene dye of the type described in US Patent 5,936,087 at Fig. 14, 
wherein R] is CEb-p-CsRi and R2 is propyldiyl, and wherein the C4 and C10 of the benzo 
groups are sulfonated. The dye is linked by the 12-nitrogen atom of the dibenzoxanthene ring 

25 structure to C7 of the 7-deazaguanine ring. For Fig. 19 A, the structure of the L linker is 
B-C=CCH20CH2CH2NHC(0)- j p-C 6 H4CH2-D, wherein B represents C7 of the nucleobase and 
D represents N12 of the dye. For Fig. 19B, the structure of the L linker is 
B-C=CCH 2 OCH 2 CH2NHC(0)CH 2 OP(0)(OH)OCH 2 CH2-NH-C(0)-/7-C6H4CH2-D, wherein B 
and D are as just described. 

30 An arrow in Fig. 19A points to a middle peak flanked on each side by a left peak and a 

right peak. The spacing between the left and middle fragments appears to be about equal to the 
spacing between the middle and right fragments, suggesting that the target sequence contains 
three cytosine subunits (complementary to the ddG terminator) which are separated from each 
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other by an equal number of intervening subunits. However, based on the known sequence of 
the target template (pGEM), the 3 '-terminal subunits of the fragments corresponding to the left 
and middle peaks are known to be separated by three intervening subunits, whereas the 3'- 
terminal subunits of the fragments corresponding to the middle and right peaks are separated by 
5 only one intervening subunit. In other words, the middle peak is four subunits longer than the 
left peak, and the right peak is only two subunits longer than the middle peak. Thus, based on 
the observed profile in Fig. 19 A, a user would have great difficulty in determining the correct 
target sequence. For example, if primer extension had been performed in the presence of four 
spectrally resolvable terminator, the middle peak could co-migrate with a longer or shorter 

10 fragment containing a different terminator, thereby obscuring the correct order of the fragments 
and rendering indeterminate the sequence information. 

In contrast, the profile in Fig. 19B, which was obtained using a dye-labeled nucleobase 
in accordance with the invention, has the expected spacing between the three peaks. The 
separation between the left and middle peaks is twice the separation between the middle and 

15 right peaks, consistent with length differences of four and two subunits, respectively, between 
the left, middle, and right peaks. Clearly, this profile would allow a user to more easily 
determine the correct target sequence. These results demonstrate how the use of an anionic 
linker in a dye-labeled nucleobase can significantly improve the correlation between fragment 
length and electrophoretic mobility of dye-labeled polynucleotides. 

20 Figs. 20A and 20B show electrophoretic profiles obtained using two different 7-deaza 

analogs of dideoxyadenosine-5 '-triphosphate (ddATP). In this study, both terminators contained 
a donor/acceptor dye pair in which Dl is a 6-carboxyfluorescein containing a 5 '-amino methyl 
group, and D2 is the same as the 5-carboxy-4,7-dichlororhodamine dye used in Example 1 
herein. In both conjugates, Dl is linked via the 4'amino methyl group to C5 of the pendent 

25 phenyl ring of D2 using the L2 linker from Example 1. For Fig. 20 A, B and Dl were linked by 
the LI linker from Example 4 (-CsCCH 2 NHC(0)-). For Fig. 20B, B and Dl were linked by an 
LI linker of the form: B-C=CCH 2 NHC(0)- j p-C6H4(S03)CH2NHC(0)-Dl, wherein the 
sulfonate group on the benzene ring is ortho to the aminomethyl group. 

The profile in the left-hand window of Fig. 20A contains a single peak that is separated 

30 from a quartet of four peaks on the right. From the approximately equal spacing between 
adjacent peaks in the quartet, a user would probably conclude that the target sequence contains 
four consecutive thymidine subunits. However, such a conclusion would be erroneous, since the 
target sequence actually contains an adenosine subunit between the third and fourth thymidine 
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subimits. A profile having a significantly improved profile is shown in the left-hand window of 
Fig. 20B. In particular, the third and fourth peaks in the quartet are separated by a distance 
consistent with a length difference of two subunits for those peaks. 

With reference to the right-hand windows of Figs. 20A and 20B, the right-hand window 
5 of Fig. 20A shows a set of closely eluting peaks which are numbered 1, 3, 5, 6, 8, and 10. Based 
on the known template sequence, the correct terminator sequence reading is 5'- 
AC ACAACATA-3 ' (corresponding to a template sequence of 3'-TGTGTTGTAT-5'). 
Unfortunately, peaks 5 and 6 in the right-hand window of Fig. 20A are separated from each 
other by more than one peak interval, so that a user might conclude that the two peaks are 

10 separated by an intervening fragment corresponding to the presence of an intervening subunit in 
the target sequence. (One "peak interval" refers to the average spacing between adjacent peaks 
differing in length by one nucleotide, for a selected region of an electropherogram.) Each peak 
corresponds to a unique-sequence fragment of DNA. 

A profile having significantly improved spacing is shown in the right-hand window of 

15 Fig. 20B. In this case, peaks 5 and 6 are separated by about one peak interval, as expected for 
fragments that differ in length by one nucleotide subunit. Overall, the improvement in spacing 
between peaks 1, 3, 5, 6, 8 and 10 allows more accurate determination of the target sequence. 

Yet another example of the advantages of the invention is illustrated in Figs. 21 A and 
21B. In this study, extension reactions were performed using 7-deaza analogs of ddATP. Both 

20 terminators contained a fluorescein dye (compound 33 from US Patent 6,008,379) which is 
linked via C5 of the pendent phenyl ring to C7 of the 7-deazaadenine ring. For Fig. 21 A, the 
structure of the L linker is B-CsC-p-C 6 H4-C=CCH 2 OCH2CH2NHC(0)-D, wherein B represents 
C7 of the nucleobase and D represents C5 of the pendent phenyl ring of the dye. For Fig. 21B, 
the structure of the L linker is B-C=C-/?-C 6 H4-CeCCH20CH 2 CH 2 NHC(0)- 

25 CH 2 OP(0)(OH)OCH 2 CH 2 NHC(0)-D, wherein B and D are as just described. 

Fig. 21A shows a profile containing four peaks consisting of a singlet on the left, a 
singlet in the middle, and a doublet on the right. From the known target sequence, the singlets 
should be separated from each other by four peak intervals (due to the presence of three 
intervening non-T subunits in the target sequence), the middle singlet and left-hand peak in the 

30 doublet should also be separated by four peak intervals, and the two peaks in the doublet should 
be separated from each other by one peak interval. The profile in Fig. 21 A is problematic 
because the middle singlet is separated from the left-hand member of the doublet by 
approximately 3.5 peak intervals. In the profile shown in Fig. 21B, however, the spacing is 
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more uniform, such that the middle singlet and left-hand peak in the doublet are separated by 
four peak intervals. Thus, the target sequence can be determined more easily using the dye- 
labeled terminator of Fig. 21B in accordance with the invention. 

The results above demonstrate how the use of an anionic linker in a dye-labeled 
nucleobase can significantly improve the correlation between fragment length and 
electrophoretic mobility of dye-labeled polynucleotides. 

In conclusion, the present invention provides conjugate compounds in which a dye and 
nucleobase are linked by an anionic linker, or, in the case of energy transfer dyes, one or more 
linkers located between a dye and a nucleobase and/or between energy transfer dyes are anionic 
linkers. Such linkers can be used in a variety of different forms and may include any of a variety 
of different anionic groups, such that base-pairing characteristics of the nucleobase and 
fluorescent properties of the dye are retained. Compounds of the invention are useful in 
nucleoside and nucleotides that can be incorporated into polynucleotides for detection. In 
particular, polynucleotides containing dye-labeled conjugates of the invention show reduced 
sequence-dependent variations in electrophoretic mobility. Thus, the invention provides 
electrophoretic separation patterns having more even spacing between nearby polynucleotide 
bands, as well as reduced band-compression, resulting in a more consistent and uniform 
relationship between polynucleotide length and electrophoretic mobility. Furthermore, 
nucleoside triphosphates that contain nucleobase-dye conjugates of the invention are good 
substrates for polymerase enzymes which can be used to incorporate such nucleotides into 
polynucleotides to form labeled polynucleotides. This is advantageous in terminator-based 
sequencing methods. When compounds of the invention are incorporated at the 3' ends of 
sequencing fragments, artifacts in electrophoretic mobility are reduced, so that accuracy of base- 
calling can be improved. 

IV. Kits 

The invention also provides kits for performing the various methods of the invention. 
For nucleic acid sequencing, the kit comprises at least one labeled nucleoside triphosphate 
comprising a conjugate described herein. The kit may also include one or more of the following 
components: a 3'-extendable primer, a polymerase enzyme, one or more 3' extendable 
nucleotides which are not labeled with conjugate, and/or a buffering agent. In some 
embodiments, the kit includes at least one labeled nucleoside triphosphate that is nonextendable. 
In other embodiments, the kit comprises four different labeled nucleoside triphosphates which 
are complementary to A, C, T and G, and each of which contains a distinct conjugate as 
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described herein. In yet another embodiment, the labeled nucleoside triphosphates are 
nonextendable. In another embodiment, the labeled nucleoside triphosphates are extendable 
ribonucleoside triphosphates. In another embodiment, the kit comprises at least one labeled, 
nonextendable nucleoside triphosphate comprising a conjugate described herein, and one or 
more of the following components: a 3 -extendable primer, a polymerase enzyme, and/or a 
buffering agent. 

The operation of the invention can be further understood in light of the following non- 
limiting examples which illustrate various aspects of the invention. 

EXAMPLES 
Materials and Methods 
Unless indicated otherwise, all reagents and anhydrous solvents were purchased from 
Aldrich Chemicals. Thin layer chromatography (TLC) analysis was conducted on aluminum 
plates precoated with 250 um layers of silica gel 60-F254. Compounds were located by UV- 
VIS lamp and/or by charring with aqueous K 2 Mn0 4 . Flash column chromatography purification 
was carried out using EM Science silica gel 60 angstrom (230-400 Mesh ASTM). NMR spectra 
were recorded in deuterated solvents (CDC1 3 , CH 3 OD, and D 2 0 with an internal Me 4 Si standard, 
8 0). ! H NMR spectra were recorded at 300 MHz, 13 C NMR spectra at 75.7 MHz, 19 F NMR 
spectra at 282.23 MHz, and 31 P NMR spectra at 121.44 MHz. In all cases, the observed NMR 
spectra were in agreement with the indicated structures. Satisfactory mass spectra were also 
obtained for the reported compounds. 

Anion-exchange high-performance chromatography (AE-HPLC) was performed as 
follows. Column: Aquapore™ AX300, 7 um particle size and 220 x 4.6 mm (PE Applied 
Biosystems). Gradient: 40% acetonitrile : 60% triethylammonium bicarbonate (TEAB, 0.1 M) 
to 40% acetonitrile : 60% TEAB (1.5 M) at 1.5 ml/min over 20 minutes. Detection: UV 
absorbance at 260 nm or Amax of each dye compound. 

Reverse phase high-performance chromatography (RP HPLC) was performed as follows. 
Column: Spheri-5 RP-C18, 5 um particle size, 220 x 4.6 mm (PE Applied Biosystems); 
gradient: 95% triethylammonium acetate (TEAA, 0.1 M) : 5% acetonitrile to 50% acetonitrile : 
50% TEAA at 1.5 ml/min over 20 minutes and then to 100% acetonitrile over 5 minutes. 
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Example 1 

Synthesis of Dye-Nucleotide Conjugate 15 
Methyl glycolate 2 (4.5 eq.) was added to Amino-Link™ 1 (1 eq.) (Connell, C, et al., 
BioTechniques 5:342-348 (1987); US Patent 4,757,151), followed by 4-N,N-dimethyl 
5 aminopyridine (DMAP) (0.1 eq.). The mixture was stirred at ambient temperature for 1 hour. 
After the reaction was complete (TLC analysis), the solution was cooled with ice-bath and then 
treated with a solution of 3-chloroperoxybenzoic acid (4 eq.) in methylene chloride. The ice- 
bath was removed. After 30 minutes, an aqueous solution of NaHSC»3 (10%) was added. The 
mixture was diluted with ethyl acetate. The organic layer was washed with NaHS0 3 (10%), 
10 saturated solution of NaHC0 3 , and dried with Na 2 S0 4 . The crude product was purified by flash 
chromatography to afford compound 3. 

To a solution of compound 3 (36 mM, 1 eq.) in methylethylketone was added Nal (10 
eq.). The mixture solution was heated at reflux for 3 hours. Solvent was removed under 
vacuum to afford crude compound 4 with Nal which was used as such without further 
15 purification. 

Crude compound 4 (1 eq.) was dissolved in 0.3 M solution of LiOH (5 eq.) in a mixed 
solvent H 2 0 : CH 3 OH (1 :3). The mixture was stirred overnight. Solvent was removed to afford 
crude compound 5 which was then dissolved in aqueous Na 2 C0 3 (5%). N-(9- 

~ Fluorenylmethoxy-carbonyloxy)succinimide (FmocOSu, 1.5 eq.) in THF was added in one 

20 portion. The mixture was stirred at ambient temperature for 3 hours. The crude product was 
purified by flash chromatography to afford compound 6. 

Compound 6(1 eq.) was dissolved in ethyl acetate and the resultant solution was washed 
with aqueous solution of HC1 (10%). The organic layer was dried with Na 2 S0 4 . Concentration 
under vacuum gave a yellow oil which was dissolved in anhydrous CH 2 C1 2 . N- 

25 hydroxysuccinimide (4 eq.) was added. The solution was cooled with an ice-bath and then 
treated with dicyclohexyl carbodiimide (DCC, 2 eq.). The ice-bath was then removed, and 
stirring was continued for 2 hours (with TLC analysis). When the reaction was complete, ethyl 
acetate was added and the solution was washed with aqueous solution of HC1 (5%). Removal of 
solvent gave compound 7. 

30 Nucleoside triphosphate 8 (7-(3-amino-l-propynyl)-2\3'-dideoxy-7-deazaadenosine-5'- 

triphosphate - see US Patents 5,047,519 and 5,151,507 by Hobbs et al. for synthesis) in 100 
mM TEA-bicarbonate solution (pH 7.0) was evaporated to dryness. The dried residue was 
suspended in a solution of 250 mM bicarbonate (pH 9.0). A solution of compound 7 in DMSO 
was added. After 1 hour, the reaction mixture was purified by HPLC (AX-300 anion exchange). 
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The product fractions were collected, concentrated to dryness, and purified by RP HPLC (C-18 
reverse phase) to afford compound 9. 

Ammonium hydroxide solution (28-30%) was added to dried compound 9. The solution 
was heated to 55°C for 20 minutes. Concentration under vacuum gave crude compound 10 

5 which was purified by HPLC (C- 1 8 reverse phase). 

NHS-ester dye intermediate 11 was prepared by reacting the fluorenylmethoxy- 
carbonyloxy ester of N-hydroxysuccinimide with the HC1 salt of p-aminomethylbenzoic acid 
(both commercially available) in the presence of base to form the expected N-Fmoc derivative. 
This product was then reacted with N-hydroxysuccinimide in the presence of DCC to form the 

10 NHS ester of the benzoic acid carboxyl group. This NHS ester was then reacted with 4'- 
aminomethyl-6-carboxyfluorescein (M.T. Shipchandler et al. ? Anal. Biochem. 162:89-101 
(1987)) to form the expected product. This product was then reacted with N- 
hydroxysuccinimide in the presence of DCC to produce NHS-ester dye intermediate 11. 

Compound 10 was suspended in a solution of 250 mM bicarbonate (pH 9.0). Then a 

15 solution of dye intermediate 11 in DMSO was added. The reaction mixture was placed in the 
dark at ambient temperature for 2 hours. Purification was done by HPLC (AX-300 anion 

:~ exchange). The recovered dye-labeled compound 12 was dried and then heated at 55°C in 
ammonium hydroxide (28-30%) for 20 minutes. Concentration under vacuum gave crude 
compound 13 which was purified by HPLC (C-18 reverse phase). 

20 Compound 13 was suspended in a solution of 250 mM bicarbonate (pH 9.0). Then a 

solution of dye 14 (e.g., see Example 17 or US Patent 5,847,162 for synthesis) in DMSO was 
added. The reaction mixture was placed in dark at ambient temperature for 2 hours. Purification 
was done by HPLC, AX-300 anion exchange and then C-18 reverse phase to afford pure 
nucleotide-dye compound 15, which contains a phosphate diester moiety within the chain of 

25 linker atoms linking the inner dye to the nucleotide. 
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Example 2 
Alternative Synthesis of Nucleotide 10 
A solution of propargyl amine 16 (3-amino-l-propyne, 3.4 eq.), DMAP (0.1 eq.), and 
methyl glycolate 2(1 eq.) was heated to reflux for 2 hours. The reaction solution was cooled to 
5 ambient temperature and was poured into aqueous solution of HQ (10%). The solution was 
extracted with ethyl acetate. Concentration under vacuum gave desired compound 17 as a 
yellow solid. 

In a manner similar to the method described in Example 1, reaction of compound 17 (1 
eq.) and Amino-Link™ 1 (1 eq.) followed by oxidation with mCPBA gave compound 18 which 

10 was purified by flash column chromatography. Deprotection of the methyl group in 18 (1 eq.) 
using Nal (10 eq.) gave compound 19 after purification by flash column chromatography. 

Nucleoside 20 (7-iodo-7-deazaadenosine, 1 eq.) was reacted with linker synthon 19 (2 
eq.) in the presence of cuprous iodine (0.4 eq.), tetrakis(triphenylphosphine)palladium (0.4 eq.), 
and triethylamine (8 eq.) in N,N-dimethylformamide for 4 hours. The reaction was concentrated 

1 5 and purified by flash column chromatography to afford nucleoside 21 . 

Phosphorous oxychloride (6 eq.) was added to nucleoside 21 (1 eq.) in 
trimethylphosphate at 0°C to form the corresponding dichloromonophosphate 22. The reaction 
mixture was stirred at 0°C for 2 hours after which it was transferred via cannula to another flask 
containing tributylammonium pyrophosphate (12 eq.) and tributylamine (22 eq.) in the presence 

20 of DMF. After another 30 minutes at 0°C, the solution was quenched with TEAB buffer (1M). 
The solution was then stirred overnight at ambient temperature. Purification was carried out by 
HPLC (C-18 reverse phase) to afford nucleoside triphosphate 23. Ammonium hydroxide (28- 
30%) was added to dried nucleoside triphosphate 23, and the resultant solution was stirred at 
ambient temperature for 1 hour. Concentration under vacuum gave nucleotide 10 which was 

25 stored in 0.1 M TEAB solution. Nucleotide 10 can be used in the synthesis of compound 15 as 
described in Example 1 . 
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Example 3 
General Coupling Reactions 
For the Examples below, the following general conditions were used. For reactions 
involving the coupling of an amino group to an N-hydroxysuccinimide-activated carboxyl group 
5 (NHS ester), the compound containing the amino group was suspended in 100 mM TEA- 
bicarbonate (pH-7.0) and evaporated to dryness. The residue was then suspended in a solution 
of 250 mM bicarbonate (pH 9.0), and then a solution of the compound containing the NHS ester 
was added. After 1 hour, the coupling reaction mixture was purified by HPLC (AX-300 anion 
exchange). The product fractions were collected, concentrated to dryness, and purified by 
1 0 HPLC (C-18 reverse phase). 

Removal of trifluoroacetate protecting groups was accomplished by treatment with 
ammonium hydroxide (28-30%) at ambient temperature for one hour. 

Fmoc groups were removed by heating at 55 °C for 20 minutes in the presence of 28- 
30% ammonium hydroxide. 

15 

Example 4 
Synthesis of Dye-Nucleotide Conjugate 33 
Aminobenzoic acid 24 (1 eq.) was dissolved in fuming sulfuric acid and the resultant 
solution was heated in an oil bath at 130°C for 4 hours. The viscous solution was poured into 
20 ice and then was neutralized with a concentrated solution of sodium bicarbonate to a pH of 8 to 
9. The aqueous solvent was removed under vacuum to afford crude compound 25 which was 
then dissolved in aqueous solution of sodium carbonate (5%). To this solution was added 
FmocOSu (Example 1, 1.5 eq.) in THF. The mixture was stirred at ambient temperature for 3 
hours. The resultant crude compound 26 was purified by flash column chromatography. 
25 N-hydroxysuccinimide and N,N-dimethylformamide were added to a slurry solution of 

compound 26 (1 eq.) in methylene chloride. The solution was cooled to 0°C, followed by 
addition of a solution of dicyclohexylcarbodiimide (DCC, 2 eq.) in methylene chloride. The ice- 
bath was removed and stirring was continued for 2 hours. The mixture was diluted with ethyl 
acetate, and the organic portion was washed with aqueous solution of HC1 (5%). Purified NHS 
30 ester 27 was obtained by flash column chromatography. 

Nucleoside triphosphate 8 (Example 1) was reacted with dye-NHS ester 28 (4'-N- 
trifluoroacetylaminomethyl-6-carboxyfluorescein according to procedures generally described in 
Example 3 to form the expected dye-labeled nucleotide 29. Removal of the trifluoroacetyl 
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(TFA) protecting group (Example 3) from conjugate 29 afforded amine compound 30. Reaction 
of compound 30 with NHS ester 27 afforded the expected Fmoc-protected compound 31. 

Following removal of the Fmoc group from compound 31 to afford amine 32, rhodamine 
NHS ester 14 was added to afford nucleo tide-dye compound 33 which contains a sulfonate 
5 substituent in the linker chain between the two dyes. 



Example 5 

Synthesis of Dve-Nucleotide Conjugate 38 
Nucleoside triphosphate 8 was reacted with Fmoc-protected fluorescein NHS ester 11 to 
10 provide the expected product 34. Following removal of the Fmoc group to afford deprotected 
aminomethyl compound 35, Fmoc protected NHS ester 7 was reacted with 35 to afford Fmoc- 
protected compound 36. Removal of the Fmoc group afforded aminoethyl phosphate ester 
compound 37 which was combined with rhodamine NHS ester 14 to afford nucleotide-dye 
compound 38. 



Example 6 
Synthesis of Dve-Nucleotide Conjugate 43 
Nucleotide 10 (Example 1) was combined with TFA-protected fluorescein NHS ester 28 
to afford the expected dye-nucleotide conjugate 39. Following removal of the TFA protecting 
group, the resultant deprotected product 40 was reacted with NHS ester 27 to afford Fmoc- 
protected conjugate 41. Removal of the Fmoc group afforded amine 42, which was reacted with 
rhodamine NHS ester 14 (Example 1) to afford nucleotide-dye conjugate 43 having a phosphate 
diester moiety within the chain of linker atoms between the inner dye and the nucleobase, and a 
sulfonate substituent in the linker between the two dyes. 

Example 7 

Alternative Route to Dye-Nucleotide Conjugate 15 
NHS ester 44 (preparable by reacting dye NHS ester 14 from Example 1 with p- 
aminomethylbenzoic acid 24 from Example 4, followed by activation of the benzoic acid with 
N-hydroxysuccinimide) was combined with amino 40 (Example 6) to afford nucleotide-dye 
conjugate 15 (Example 1). 



Example 8 

Synthesis of Dye-Nucleotide Conjugates 51 and 57 
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8A. Using procedures described in Example 1, 5-(3-amino-l-propynyi)-2',3'- 
dideoxycytidine-5 '-triphosphate 45 (see Hobbs et al., supra, for synthesis) was reacted with 
NHS ester 7 (Example 1) to afford Fmoc-protected intermediate 46. Removal of the Fmoc 
group afforded amine 47, which was then reacted with NHS ester 11 (Example 1) to afford 
5 conjugate 48. Removal of the Fmoc protecting group afforded amine 49, which was then 
reacted with rhodamine NHS ester 50 (e.g., see US Patent 5,847,162, for synthesis) to afford 
conjugate 51. 

8B. The protocol in Example 8 A was carried out using 5-(3-aminoethoxy-l-propynyl)- 
2',3'-dideoxycytidine-5'-triphosphate 52 (e.g., see US Patent 5,821,356 for synthesis) instead of 
10 nucleotide triphosphate 45, to afford conjugate 57. 

Example 9 
Synthesis of Dye-Nucleotide Conjugate 64 
Using the synthetic scheme described in Example 4, conjugate 64 was prepared from 5- 
15 (3-aminoethoxy-l-propynyl)-2',3'-dideoxythymidine-5 '-triphosphate 58 (e.g., see US Patent 
5,821,356 for synthesis) instead of nucleotide 8, dye-NHS ester 28 (supra), linker NHS ester 27 
(supra), and dye-NHS ester 63 (e.g., prepared following methods described in US Patent 
6,080,852) instead of compound 14. 

20 Example 10 

Synthesis of Dye-Nucleotide Conjugate 69 
Using procedures described in Example 5, conjugate 69 was prepared from 5-(3- 
aminoethoxy-l-propynyl)-2',3'-dideoxythymidine-5'-triphosphate 58 (Example 9) instead of 
nucleotide 8, dye NHS ester 11 (supra), phosphate linker synthon 7 (supra), and dye NHS ester 
25 63 (Example 9) instead of compound 14. 

Example 1 1 
Synthesis of Dye-Nucleotide Conjugate 77 
Using procedures described in Example 5, conjugate 77 was prepared from 7-(3- 
30 aminoethoxy-l-propynyl)-2', 3 '-dideoxy-7-deazaguanosine-5' -triphosphate 70 instead of 
nucleotide 8, dye NHS ester 71 (prepared in the same way as compound 11 supra, using the 5- 
carboxyfluorescein instead of the 6-carboxyfluorescein), phosphate linker synthon 7 (supra), and 
dye NHS ester 76 (e.g., see US Patent 5,847,162 for synthesis) instead of compound 14. 



48 



Case No. 4495 II US 



Example 12 

Synthesis of Dye-Nucleotide Conjugates 82 and 85 
12 A. Using methods described in Example 3, sulfonate-containing linker synthon 27 
(supra) was coupled to 7-(3-aminoethoxy-l-propynyl)-3'-fluoro-2',3'-dideoxy-7-deaza- 
5 guanosine-5'-triphosphate 78 (e.g., see US Patent 5,821,356 for synthesis) to afford Fmoc- 
protected product 79. After the Fmoc group was removed, resultant amine 80 was reacted with 
NHS-activated dye 81 (e.g., see US Patent 6,051,719 for synthesis) to produce dye-labeled 
nucleotide 82 containing a sulfonated benzene linker. 

12B. Dye-nucleotide conjugate 85 was prepared by coupling deprotected amine 80 
10 (Example 12 A) to phosphate-containing linker synthon 7 (supra) to afford Fmoc-protected 
product 83. After removal of the Fmoc group, resultant amine 84 was coupled to NHS-activated 
dye 81 (Example 12 A) to afford dye-nucleotide conjugate 85 having a linking group containing 
both a sulfonate group and a phosphate group. 

15 Example 13 

Synthesis of Dye-Nucleotide Conjugate 95 
Using procedures described in Example 1, methyl 4-(hydroxymethyl) benzoate 86 (1.3 
eq.), Ammo-Link™ 1 (Example 1, 1 eq.), and DMAP (0.1 eq.) were reacted together, followed 
by oxidation with mCPBA (1.5 eq.) to afford phosphotriester 87. Removal of the methyl group 

20 from the phosphotriester group using Nal (10 eq.) afforded phosphodiester 88, which was 
subsequently treated with LiOH (6 eq.) to remove the methyl and trifluoroacetyl protecting 
groups, yielding amino acid phosphate diester 89. Protection of the amino group with FmocOSu 
(Example 1) gave compound 90. NHS ester 91 was obtained by treating compound 90 with 
DCC (1.3 eq.) and N-hydroxysuccinimide (3 eq.). NHS ester 91 was added to nucleotide 78 

25 (Example 12 A) to form Fmoc-protected compound 92. After removal of the Fmoc group, 
resultant amine compound 93 was coupled to dye 81 (Example 12 A) to afford dye-nucleotide 
conjugate 95 containing a phosphodiester within the linker chain. 

Example 14 

30 Synthesis of Dye-Nucleotide Conjugate 106 

Using procedures described in Example 2, propargyl alcohol 96 (3-propyn-l-ol, 1.5 eq.), 
Amino-Link™ 1 (Example 1, 1 eq.), and DMAP (0.1 eq.) were reacted together, followed by 
oxidation with mCPBA (1.5 eq.) to afford phosphotriester 97. Removal of the methyl group 
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from the phosphotriester group using Nal (10 eq.) afforded propargyl-trifluoroacetylaminoethyl 
phosphodiester 98. 

Phosphodiester 98 (1.5 eq.) was reacted with 7-iodo-7-deazaadenosine 20 (1 eq.) in the 
presence of Cul (0.4 eq.), Pd[PPh 3 ] 4 (0.4 eq.), and triethylamine (8 eq.) in DMF to afford 
5 nucleoside product 99. 

Phosphorous oxychloride (6 eq.) was reacted with nucleoside 99 (1 eq.) to form 
dichlorophosphate intermediate 100. This intermediate was treated with tributylammonium 
pyrophosphate (12 eq.) and tributylamine (22 eq.), followed by hydrolysis with TEAB buffer (1 
M), to form nucleotide 5' triphosphate 101. Removal of the TFA group was achieved by 
10 ammonium hydroxide to afford nucleotide amine 102. 

Dye NHS ester 11 (Example 1) was added to nucleotide amine 102 in NaHCC>3 buffer to 
afford dye-nucleotide 104. After the Fmoc group was removed, the resultant amine compound 
105 was coupled to dye NHS ester 14 (Example 1) to afford dye-nucleotide conjugate 106. 

15 Example 15 

Synthesis of Dve-Nucleotide Conjugate 123 
Triethylsilylacetylene 108 (3 eq.) was coupled to 4-iodophenol 107 (1 eq.) in the 
presence of Cul (0.05 eq.), Pd[PPh 3 ] 4 (0.05 eq.), and triethylamine (2 eq.) in DMF. The mixture 
was stirred at ambient temperature for 5 hours after which it was concentrated under vacuum to 
20 afford a crude black oil. The oil was purified by flash column chromatography to afford pure p- 
(triethylsilyl)ethynyl phenol 109. 

To a solution of compound 109 in dichloromethane were added triflic anhydride 
(trifluoromethanesulfonic anhydride, 1.2 eq.) and triethylamine (1.2 eq.) at -40 °C. After 30 
minutes, the reaction was quenched with water and the solution was extracted with 
25 dichloromethane. Purification was achieved by flash column chromatography to afford 
trifluoromethanesulfonate 111. 

Propargyl alcohol 110 (1.1 eq.) was coupled to compound 111 (1 eq.) in the presence of 
Cul (0.1 eq.), Pd[PPh 3 ] 4 (0.05 eq.), and triethylamine (2 eq.) in DMF. The mixture was stirred at 
60°C for 7 hours and then concentrated under vacuum. Purification by flash column 
30 chromatography gave alcohol 112. 

Alcohol 112 (1 eq.), Amino-Link™ 1 (1 eq.), and DMAP (0.1 eq.) were reacted together 
using procedures described above, followed by oxidation with mCPBA (1.2 eq.) to produce 
phosphotriester 113. Removal of the methyl group from the phosphate triester group using Nal 
(10 eq.) afforded phosphodiester 114. 
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To a solution of compound 114 in tetrahydrofuran (THF) at 0°C was added tetra- 
butylammonium fluoride (1.5 eq.). After 30 minutes, the solution was quenched with aqueous 
ammonium chloride solution (10%) and extracted with ethyl acetate. Purification was achieved 
by flash column chromatography to afford phosphate-containing linker synthon 115. 
5 Compound 115 (2 eq.) was coupled to nucleoside 116 (7-iodo-3'-fiuoro-2',3'-dideoxy-7- 

deazaguanosine, 1 eq.) (e.g., see Hobbs et al., supra, for synthesis) in the presence of Cul (0.4 
eq.), Pd[PPh 3 ] 4 (0.4 eq.), and triethylamine (10 eq.) in DMF to afford nucleoside 117. 

Phosphorous oxychloride (6 eq.) was reacted with nucleoside 117 (1 eq.) to form 
intermediate 118 which was treated with tributylammonium pyrophosphate (12 eq.) and 
10 tributylamine (22 eq.), followed by hydrolysis with TEAB buffer (1 M) to form nucleotide 119. 
Removal of the TFA group was achieved using ammonium hydroxide to afford amine 120. 

Dye 71 (supra) was added to nucleotide amine 120 in NaHC0 3 buffer to afford dye- 
labeled nucleotide 122 (in these examples, there is no compound 121). After the Fmoc group 
2 was removed, the resultant amine compound 123 was coupled with dye 76 (Example 11) to 
1 5 afford dye-nucleotide conjugate 124. 

Example 16 

Nucleotide-Dve Conjugate 137 with Phosphonate-Containing Linker 
Lithiated tert-butyl acetate 125, prepared from tert-butyl acetate and lithio N,N- 

20 diisopropylamine (prepared according to M.W. Rathke et al., J. Amer. Chem. Soc. 95:3050 
(1973)) in hexane, is added dropwise to a hexane solution of 2-cyanoethyl 
diisopropylchlorophosphoramidite 126 (Aldrich Chemical Company) under an inert atmosphere. 
The reaction mixture is washed free from salts with water. The resultant organic layer is dried 
over anhydrous sodium sulfate, filtered, reduced in volume, and phosphoramidite 127 is isolated 

25 by column chromatography on silica gel by elution with hexane and dichloromethane. 

Phosphoramidite 127 is dissolved in dry acetonitrile and placed under an inert 
atmosphere. To this solution is added an acetonitrile solution of 1.2 equivalents of 2- 
(trifluoroacetamido)ethanol 128 and 1.2 equivalents of tetrazole. The product phosphonite 129 
is isolated by column chromatography on silica gel by elution with hexane and dichloromethane. 

30 Phosphonite 129 is dissolved in dry dichloromethane. To the solution is added 1.5 

equivalents of tert-butyl hydroperoxide (TBHP) in dichloromethane. After complete reaction, 
the excess TBHP is removed from the reaction mixture by washing with water. The resultant 
organic layer is dried over anhydrous sodium sulfate, filtered, reduced in volume, and the 
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phosphonate diester product 130 is isolated by column chromatography on silica gel by elution 
with hexane and dichloromethane. 

Phosphonate diester 130 is dissolved in dichloromethane, and 2 equivalents of tri- 
fluoroacetic acid (TFA) are added. After the reaction is complete, the excess TFA is removed 
5 by washing with water. The resultant organic layer is dried over anhydrous sodium sulfate, 
filtered, reduced in volume, and the phosphonate diester carboxylic acid 131 is isolated by 
column chromatography on silica gel by elution with methanol and acetic acid in 
dichloromethane. 

Phosphonate diester 131 is dissolved in ethyl acetate. To the solution is added N- 
10 hydroxysuccinimide (NHS, 2 eq.) and dicyclohexylcarbodiimide (DCC, 1.2 eq.). After the 
reaction is complete, dicyclohexylurea is removed by filtration, and excess NHS is removed by 
washing with water. The resultant organic layer is dried over anhydrous sodium sulfate, filtered, 
reduced in volume, and the phosphonate NHS ester 132 is isolated by column chromatography 
on silica gel by elution with hexane and dichloromethane. 
15 Phosphonate NHS ester 132 is added to nucleotide 8 (Example 1) in NaHC0 3 buffer to 

afford nucleotide 133. After the cyanoethyl and TFA group are removed by treatment with 
ammonium hydroxide, the resultant nucleotide amine 134 is coupled to dye-NHS ester 11 
(Example 1) to afford the expected dye-labeled nucleotide 135. The Fmoc group is removed, 
and the amine product 136 is coupled with dye-NHS ester 14 (Example 1) to afford dye- 
20 nucleotide conjugate 137. 

Example 17 
Synthesis of Dve-NHS Ester 14 
Bicyclic amine t-Boc ester 141 (12.8 gm, 47 mmole, US Patent No. 5,688,808), 1- 
25 bromo-3-chloropropane (29.3 gm, 187 mmole), sodium iodide (56.4 gm, 376 mmole) and 
sodium bicarbonate (7.9 gm, 94 mmole) were refluxed together in 150 ml CH3CN for 18 hours. 
The mixture was cooled to room temperature, filtered, and evaporated. The filter cake was 
washed with 300 ml hexane which was combined with the filtrate and washed with 2 x 50 ml 
water and 50 ml saturated NaCl, dried over MgS0 4 , filtered, and concentrated under vacuum. 
30 The product was purified by chromatography on silica gel, eluting with hexane/ethyl acetate : 
20/1, to give tricyclic amine pivalate ester 142 (pale yellow oil, 9.5 gm, 30 mmole, 64% yield). 
The ester group of 142 was hydro lyzed in a solution of lithium hydroxide monohydrate (2.6 gm, 
60 mmole) in 15 ml water and 120 ml methanol. After stirring for one hour at room 
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temperature, the mixture was concentrated under vacuum, dissolved in 30 ml 1M HC1, and 
extracted with 3 x 100 ml of diethylether. The combined ether extracts were washed with 50 ml 
of 200 mM pH 7 phosphate buffer, dried over MgS0 4 , filtered and concentrated under vacuum 
to give tricyclic amine 143 as a brown solid. Tricyclic amine 143 and 3,6-dichloro, 4- 
5 isopropylcarboxylate phthalic anhydride 144 were refluxed in toluene to give Friedel-Craft 
acylation product ketone 145 (Abs. max 400 nm). Cyclization of 145 with 143 in phosphoryl 
trichloride (as activating agent) and chloroform at reflux gave dye isopropyl carboxylic ester 146 
as a mixture of isopropylcarboxylate regioisomers. After cleavage of the isopropyl group, dye 
carboxylic acid 147 was converted to NHS-rhodamine dye 14 (which can be used in Example 
10 1). 

Example 18 
Synthesis of Dve-Nucleotide Conjugate 211 
To a solution of 4-bromoisophthalic acid 201 (lOg, 40.8mmol) in methanol (150mL) was 
15 added concentrated sulfuric acid (3.5mL). The solution mixture was heated at reflux 
temperature for 24 hours. After the reaction was complete, the methanol solvent was removed 
under reduced pressure. The residue was made basic with saturated sodium bicarbonate 
(NaHCOs). The solution was extracted with ethyl acetate. The extract was washed with water 
and dried over sodium sulfate (Na 2 S0 4 ). Removal of solvent gave oil, which solidified at 
20 ambient temperature. The solid was titurated twice with 20mL of methanol and hexane (1 :3) to 
give 8g (90%) of 4-bromoisophthalic acid dimethyl ester 202: ! H NMR (400 MHz, CDC1 3 ) 6 
3.93 (s, 3H), 3.93 (s, 3H), 7.74 (d, J = 8 Hz, 1H), 7.95 (dd, J = 2 Hz, J = 8 Hz, 1H), 8.43 (d, J = 2 
Hz, 1H). 

To a solution of 4-bromoisophthalic acid dimethyl ester 202 (2.95g, 10.8mmol) in 
25 anhydrous DMF (50ml), Copper(I) cyanide (1.20g, 13.5mmol) was added in one portion. The 
slurry solution was heated at reflux temperature for one hour. When the solution was cooled to 
ambient temperature, it was poured into 300mL of ammonium chloride solution (10%) at 0 °C. 
White precipitate was formed. The slurry solution was stirred at ambient temperature for 30 
minutes. Then it was extracted with ethyl acetate. The combined extracts were dried over 
30 sodium sulfate. Removal of solvent gave white solid which was washed with cold methanol to 
give 1.89g (80%) of 4-cyanoisophthalic acid dimethyl ester 203: *H NMR (400 MHz, CDC1 3 ) 6 
3.95 (s, 3H), 4.05 (s, 3H), 7.91 (d, J = 8 Hz, 1H), 8.30 (dd, J = 1 .7 Hz, J = 8 Hz, 1H), 8.77 (d, J = 
1.7 Hz, 1H). 
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4-Cyanoisophthalic acid dimethyl ester 203 (1.70g, 7.76mmol) was dissolved in 
methanol (lOOmL) and then treated with an aqueous solution of lithium hydroxide (2.0g, 
46.5mmol) in 50mL of water. The basic solution was stirred at ambient temperature for 1.5 
hour. It was then poured into a 10% HC1 solution. The solution was extracted with ethyl acetate 
5 and the combined extracts were dried over Na 2 S04. Removal of solvent gave quantitative yield 
of 4-cyanoisophthalic acid 204: ! H NMR (400 MHz, CD 3 OD) 5 8.01 (d, J = 8 Hz, 1H), 8.33 (dd, 
J = 1.7 Hz, J = 8 Hz, 1H), 8.73 (d, J = 1.7 Hz, 1H). 

A pressure flask was charged with 4-cyanoisophthalic acid 204 (1.40g, 7.33mmol), 
ethanol (lOOmL), Pd/C (10%) (0.5Qmg), and concentrated HC1 (1.5mL). The mixture solution 

10 was hydrogenated at 50 PSI overnight. The solid impurities were removed by filtration. 
Removal of solvent gave quantitative yield of 4-aminomethylisophthalic acid 205: *H NMR 
(400 MHz, CD 3 OD) 5 4.46 (s, 2H), 7.69 (d, J = 8 Hz, 1H), 8.26 (dd, J = 2 Hz, J = 8 Hz, 1H), 
8.76 (d, J= 1.7 Hz, 1H). 

A round bottom flask was charged with 4-aminomethylisophthalic acid 205, ethanol 

1 5 (20mL), methanol (20mL), excess ethyl trifluoroacetate (20mL), and triethylamine (lOmL). The 
mixture solution was stirred at ambient temperature for 1 hour. It was then poured into a 
hydrochloric acid solution (10%). The solution was extracted with ethyl acetate. The combined 
extracts were dried over Na 2 S0 4 . Removal of solvent gave compound 206 (1 .8g): ! H NMR (400 

~ MHz , CD3OD) 5 4.9 (s, 2H), 7.51 (d, J = 8 Hz, 1H), 8.17 (dd, J = 2 Hz, J = 8 Hz, 1H), 8.65 (d, J 

20 =2 Hz, 1H). 

A 100 mL, round bottom flask was charged with crude compound 206 (lg, 3.43mmol), 
anhydrous DMF (30mL), 0-(N-succinimidyl)-N,N,N',N'-tetramethyluronium tetrafluoroborate 
(lg, 3.43mmol), and diisopropylethylamine (1.3g, 10.3mmol). The reaction solution was stirred 
at ambient temperature for 30 minutes. Solvent was removed under reduced pressure. The 

25 crude compound was added 5% HC1 solution and was extracted with ethyl acetate. The 
combined organic extracts were dried over Na 2 S0 4 . Purification was achieved by flash column 
chromatography on silica gel with eluent CH 2 Cl 2 :CH 3 OH (10:1) followed by CH 2 C1 2 :CH 3 0H 
(5:1) to afford 500mg (38%) of activated ester 207: H'lSIMR (400 MHz, CDC1 3 + 10% CD3OD) 
8 2.93 (s, 4H), 4.80 (s, 2H), ), 7.60 (d, J = 8 Hz, 1H), 8.17 (dd, J = 2 Hz, J - 8 Hz, 1H), 8.7.0 (d, 

30 J = 2 Hz, 1H); 19 F NMR -178ppm (s). 

Compound 208 (which is the same as compound 58 noted in Example 9) (1 equiv) was 
dissolved in a minimum amount of formamide and a solution of linker synthon 207 (7 equiv) in 
DMSO (1 mg of 207 per 5 uL of DMSO) was added, followed by diisopropylethylamine (20 
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equiv). The coupling reaction was complete in one hour at ambient temperature. Purification 
was carried out by HPLC (AX-300 anion exchange). The recovered compound was dried under 
reduced pressure and purified by HPLC (C-18 reverse phase). The compound was dried under 
reduced pressure and then heated in ammonium hydroxide (28-30%) for 15 minutes at 55°C. 
5 Then it was dried and purified by HPLC (C-18 reverse phase) to give nucleotide 209. 

The following can be used as a general procedure to produce dye conjugates from 
compound 209 and an activated ester, such as an NHS ester of a dye (referred to as "NHS dye" 
in Fig. 22B). Compound 209 (1 equiv) is suspended in a minimum amount of a solution of 250 
mM bicarbonate (pH 9.0), and a solution of dye NHS ester 210 in DMSO (3 equiv, 1 mg of dye 
10 NHS ester per 12 uL of DMSO) is then added. The reaction mixture is placed in the dark at 
ambient temperature for 1 hour. The product can be purified by anion exchange HPLC (AX- 
300) and then C-18 reverse phase HPLC to give pure dye-labeled nucleotide 211. 



All publications and patent applications mentioned herein are hereby incorporated by 
15 reference as if each publication or patent application was specifically and individually indicated 
to be incorporated by reference. 

Although the invention has been described with reference to certain illustrative 
embodiments and examples, it will be appreciated that various modifications and variations can 
be made without departing from the scope and spirit of the invention. 
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